Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Muennighoff
/
Qwen2.5-1.5B-hl-false-v7
like
0
Text Generation
Transformers
Safetensors
simplescaling/openaimath
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-hl-false-v7
Commit History
End of training
6bcc6e0
verified
Muennighoff
commited on
Jun 17
Model save
dbedc48
verified
Muennighoff
commited on
Jun 17
Training in progress, step 1440
b871ce3
verified
Muennighoff
commited on
Jun 17
Training in progress, step 1280
d3258d5
verified
Muennighoff
commited on
Jun 17
Training in progress, step 1120
94e549b
verified
Muennighoff
commited on
Jun 17
Training in progress, step 960
c6d4f33
verified
Muennighoff
commited on
Jun 17
Training in progress, step 800
c1eb749
verified
Muennighoff
commited on
Jun 17
Training in progress, step 640
1364ff9
verified
Muennighoff
commited on
Jun 17
Training in progress, step 480
56042d2
verified
Muennighoff
commited on
Jun 17
Training in progress, step 320
ff1b650
verified
Muennighoff
commited on
Jun 17
initial commit
b4c88f8
verified
Muennighoff
commited on
Jun 16