Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
68
63
Quentin Gallouédec
PRO
qgallouedec
Follow
jsulz's profile picture
alabebop's profile picture
gpbhupinder's profile picture
228 followers
·
81 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
trl-internal-testing/tiny-Llama4ForCausalLM
published
a model
6 days ago
trl-internal-testing/tiny-Llama4ForCausalLM
updated
a model
7 days ago
qgallouedec/Qwen-2.5-7B-Simple-RL
View all activity
Organizations
Articles
5
Article
283
Open R1: Update #3
Article
304
Open-R1: Update #1
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
12
Train Memory
📈
Generate memory forecast for ML models
models
725
Sort: Recently updated
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
7 days ago
•
5
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
8 days ago
•
5
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
19 days ago
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
21 days ago
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
about 1 month ago
•
10
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
17
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
116
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
39
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
131
•
3
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
35
•
4
Expand 725 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
16 days ago
•
98.7k
•
230
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
71
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
27
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
40
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
30
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
36
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
28
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
25
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
30
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
34
Expand 67 datasets