Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Muennighoff
/
Qwen2.5-1.5B-hl-false-v4
like
0
Text Generation
Transformers
Safetensors
simplescaling/openaimath
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-hl-false-v4
/
model.safetensors
Commit History
Model save
069ef98
verified
Muennighoff
commited on
Jun 11
Training in progress, step 1440
8da9d0e
verified
Muennighoff
commited on
Jun 11
Training in progress, step 1280
7c94740
verified
Muennighoff
commited on
Jun 10
Training in progress, step 1120
705458c
verified
Muennighoff
commited on
Jun 10
Training in progress, step 960
7dcaba8
verified
Muennighoff
commited on
Jun 10
Training in progress, step 800
938e9ce
verified
Muennighoff
commited on
Jun 10
Training in progress, step 640
0a8dd9c
verified
Muennighoff
commited on
Jun 10
Training in progress, step 480
ad2b71c
verified
Muennighoff
commited on
Jun 10
Training in progress, step 320
018350e
verified
Muennighoff
commited on
Jun 10
Training in progress, step 160
6734e35
verified
Muennighoff
commited on
Jun 10