Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
78
78
Quentin Gallouédec
PRO
qgallouedec
Follow
sham1618's profile picture
omnisson's profile picture
Rakitto's profile picture
329 followers
·
265 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
about 8 hours ago
trl-internal-testing/tiny-Qwen3ForCausalLM
updated
a model
about 8 hours ago
trl-internal-testing/tiny-Qwen2ForCausalLM-2.5
updated
a model
about 9 hours ago
trl-internal-testing/small-Qwen2ForCausalLM-2.5
View all activity
Organizations
Articles
7
Article
51
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
Article
37
Gotchas in Tokenizer Behavior Every Developer Should Know
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
5
Sort: Recently updated
Sleeping
Tmp
🚀
Runtime error
2
Run Hello World
👀
Sleeping
Compute
👁
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
14
Train Memory
📈
Generate memory forecast for ML models
models
732
Sort: Recently updated
qgallouedec/Qwen3-1.7B-SFT
Updated
6 days ago
qgallouedec/Qwen3-0.6B-SFT
Updated
6 days ago
qgallouedec/Qwen2.5-0.5B-SFT
Updated
21 days ago
qgallouedec/SmolLM2-360M-Rickified-GRPO
Text Generation
•
Updated
24 days ago
•
62
•
1
qgallouedec/SmolLM2-360M-Rickified
Text Generation
•
Updated
25 days ago
•
1.67k
qgallouedec/SmolLM2-360M-SFT
Text Generation
•
Updated
May 9
•
13
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
May 1
•
131
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
Apr 8
•
14
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Apr 7
•
26
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
Expand 732 models
datasets
72
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
18 days ago
•
120k
•
300
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
24 days ago
•
1.79k
•
226
•
1
qgallouedec/rick-science
Viewer
•
Updated
29 days ago
•
1.18k
•
187
•
1
qgallouedec/physics-problems
Viewer
•
Updated
May 10
•
247
•
38
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10
•
6.8k
•
39
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29
•
16.4k
•
47
•
2
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
74
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
67
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
41
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
49
Expand 72 datasets