Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
Pent's profile picture
mouadjer's profile picture
21world's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
published
a model
5 days ago
RedaAlami/Qwen-2.5-7B-Simple-RL
published
a model
10 days ago
RedaAlami/Qwen2-0.5B-GRPO-test
updated
a dataset
18 days ago
RedaAlami/merged-dpo-safety
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
13
Sort: Recently updated
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
5 days ago
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
10 days ago
RedaAlami/zephyr-7b-dpo-qlora
Updated
Oct 4, 2024
•
43
RedaAlami/zephyr-7b-dpo-full
Updated
Aug 29, 2024
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
5
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_processed
Updated
Jul 12, 2024
RedaAlami/falcon-11b-instruct-dpo-full
Updated
Jul 1, 2024
Expand 13 models
datasets
139
Sort: Recently updated
RedaAlami/merged-dpo-safety
Viewer
•
Updated
18 days ago
•
3.95k
•
32
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
18 days ago
•
36
•
32
RedaAlami/eng-batch-4-dpo-safety_test
Viewer
•
Updated
18 days ago
•
53
•
36
RedaAlami/eng-batch-5-dpo-safety_test
Viewer
•
Updated
18 days ago
•
63
•
33
RedaAlami/eng-batch-6-dpo-safety_test
Viewer
•
Updated
18 days ago
•
58
•
32
RedaAlami/eng-batch-6-dpo-safety_train
Viewer
•
Updated
18 days ago
•
1.11k
•
34
RedaAlami/eng-batch-5-dpo-safety_train
Viewer
•
Updated
18 days ago
•
977
•
39
RedaAlami/eng-batch-4-dpo-safety_train
Viewer
•
Updated
18 days ago
•
1.06k
•
37
RedaAlami/eng-batch-3-dpo-safety_train
Viewer
•
Updated
18 days ago
•
596
•
32
RedaAlami/hate_lgbtq_v3
Viewer
•
Updated
Dec 10, 2024
•
393
•
60
Expand 139 datasets