Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
qingyangzhang
/
Qwen2.5-3B-Random-Natural-Reasoning
like
0
Text Generation
Transformers
Safetensors
qingyangzhang/natural_reasoning_simple
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2.5-3B-Random-Natural-Reasoning
Commit History
End of training
77f34a7
verified
qingyangzhang
commited on
Jun 2
Model save
eec0948
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 125
eb7234e
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 120
4c1e7e1
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 110
8607141
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 100
1837d1d
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 90
af5fffa
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 80
8b41320
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 70
549075d
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 60
9c9304a
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 50
e17c0d7
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 40
35a7e5c
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 30
fb3fad4
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 20
30492c8
verified
qingyangzhang
commited on
Jun 2
Training in progress, step 10
3f98c13
verified
qingyangzhang
commited on
Jun 2
initial commit
d2510e6
verified
qingyangzhang
commited on
Jun 2