Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Muennighoff
/
Qwen2.5-1.5B-hl-baseline-v10
like
0
Text Generation
Transformers
Safetensors
simplescaling/openaimath
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-hl-baseline-v10
Commit History
End of training
47bc16b
verified
Muennighoff
commited on
28 days ago
Model save
9e6f069
verified
Muennighoff
commited on
28 days ago
Training in progress, step 1440
ab525b2
verified
Muennighoff
commited on
28 days ago
Training in progress, step 1280
43cfd84
verified
Muennighoff
commited on
28 days ago
Training in progress, step 1120
576a03d
verified
Muennighoff
commited on
29 days ago
Training in progress, step 960
f010beb
verified
Muennighoff
commited on
29 days ago
Training in progress, step 800
71e1c6d
verified
Muennighoff
commited on
29 days ago
Training in progress, step 640
86195ad
verified
Muennighoff
commited on
29 days ago
Training in progress, step 480
99f8b67
verified
Muennighoff
commited on
29 days ago
Training in progress, step 320
ffb934c
verified
Muennighoff
commited on
29 days ago
Training in progress, step 160
896436b
verified
Muennighoff
commited on
29 days ago
initial commit
c09967e
verified
Muennighoff
commited on
29 days ago