Abdelaziz Bounhar PRO

BounharAbdelaziz

AI & ML interests

Deep Learning, Reinforcement Learning, AI Agents, Generative Modeling, NLP, Information Theory, Security of Machine Learning, ...etc

Recent Activity

published a model 2 days ago

BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca

published a model 2 days ago

BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca

updated a collection 2 days ago

RLHF

View all activity

Organizations

published 2 models 2 days ago

BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca

Text Generation • Updated 2 days ago

BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca

Text Generation • Updated 2 days ago

updated a collection 2 days ago

RLHF

Collection

Some RLHF experiments using GRPO and DPO. • 3 items • Updated 2 days ago

updated 3 models 2 days ago

published a model 2 days ago

BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K

Text Generation • Updated 2 days ago

updated a model 2 days ago

BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old

Text Generation • Updated 2 days ago

published a model 3 days ago

BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old

Text Generation • Updated 2 days ago

liked 2 datasets 7 days ago

AIffl/french_orca_dpo_pairs

Viewer • Updated May 26, 2024 • 12.7k • 93 • 6

AIffl/french_hh_rlhf

Viewer • Updated Jun 15, 2024 • 169k • 141 • 4

liked 2 datasets 16 days ago

a-m-team/AM-Thinking-v1-Distilled

Preview • Updated 15 days ago • 5.55k • 29

a-m-team/AM-Qwen3-Distilled

Preview • Updated May 22 • 2.73k • 11

liked a model 17 days ago

QCRI/Fanar-1-9B-Instruct

Text Generation • Updated 22 days ago • 3.54k • 21

liked a dataset 18 days ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated 18 days ago • 1.2M • 21k • 112

Abdelaziz Bounhar PRO

AI & ML interests

Recent Activity

Organizations

BounharAbdelaziz's activity