Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1930.4
TFLOPS
14
41
114
Abdelaziz Bounhar
PRO
BounharAbdelaziz
Follow
Winnougan's profile picture
HIBA-AI's profile picture
ManuelBoissiere's profile picture
48 followers
·
50 following
http://abdelazizbounhar.com/
BounharAbdelaziz
abdelaziz-bounhar-a58910138
AI & ML interests
Deep Learning, Reinforcement Learning, AI Agents, Generative Modeling, NLP, Information Theory, Security of Machine Learning, ...etc
Recent Activity
published
a model
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
published
a model
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
updated
a collection
2 days ago
RLHF
View all activity
Organizations
BounharAbdelaziz
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
2 models
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
Text Generation
•
Updated
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
Text Generation
•
Updated
2 days ago
updated
a collection
2 days ago
RLHF
Collection
Some RLHF experiments using GRPO and DPO.
•
3 items
•
Updated
2 days ago
updated
3 models
2 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K
Text Generation
•
Updated
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
Text Generation
•
Updated
2 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
Text Generation
•
Updated
2 days ago
published
a model
2 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K
Text Generation
•
Updated
2 days ago
updated
a model
2 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old
Text Generation
•
Updated
2 days ago
published
a model
3 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old
Text Generation
•
Updated
2 days ago
liked
2 datasets
7 days ago
AIffl/french_orca_dpo_pairs
Viewer
•
Updated
May 26, 2024
•
12.7k
•
93
•
6
AIffl/french_hh_rlhf
Viewer
•
Updated
Jun 15, 2024
•
169k
•
141
•
4
liked
2 datasets
16 days ago
a-m-team/AM-Thinking-v1-Distilled
Preview
•
Updated
15 days ago
•
5.55k
•
29
a-m-team/AM-Qwen3-Distilled
Preview
•
Updated
May 22
•
2.73k
•
11
liked
a model
17 days ago
QCRI/Fanar-1-9B-Instruct
Text Generation
•
Updated
22 days ago
•
3.54k
•
21
liked
a dataset
18 days ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
18 days ago
•
1.2M
•
21k
•
112
Load more