Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
qingyangzhang
/
Qwen2.5-3B-GRPO-Natural-Reasoning
like
0
Text Generation
Transformers
Safetensors
qingyangzhang/natural_reasoning_simple
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2.5-3B-GRPO-Natural-Reasoning
Commit History
End of training
7dfcb0b
verified
qingyangzhang
commited on
May 2
Model save
6a6be3d
verified
qingyangzhang
commited on
May 2
Training in progress, step 125
412bf1e
verified
qingyangzhang
commited on
May 2
Training in progress, step 120
296afaa
verified
qingyangzhang
commited on
May 2
Training in progress, step 110
ab31c01
verified
qingyangzhang
commited on
May 1
Training in progress, step 100
ddf34c3
verified
qingyangzhang
commited on
May 1
Training in progress, step 90
35dd503
verified
qingyangzhang
commited on
May 1
Training in progress, step 80
d9f0355
verified
qingyangzhang
commited on
May 1
Training in progress, step 70
24d6b3c
verified
qingyangzhang
commited on
May 1
Training in progress, step 60
6a55acc
verified
qingyangzhang
commited on
May 1
Training in progress, step 50
f63a39a
verified
qingyangzhang
commited on
May 1
Training in progress, step 40
4025e72
verified
qingyangzhang
commited on
May 1
Training in progress, step 30
c0b4b7b
verified
qingyangzhang
commited on
May 1
Training in progress, step 20
7301ccb
verified
qingyangzhang
commited on
May 1
Training in progress, step 10
4051f7d
verified
qingyangzhang
commited on
May 1
Training in progress, step 10
e0118a3
verified
qingyangzhang
commited on
May 1
End of training
4249d13
verified
qingyangzhang
commited on
Apr 30
Model save
54e1ea9
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 125
7e44cab
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 120
08fc469
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 110
3b71ac6
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 100
770fb41
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 90
6be43e0
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 80
b5d1baa
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 70
e692cf0
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 60
3e6cd49
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 50
8b28b1c
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 40
c3729e0
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 30
0beb2db
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 20
762df7c
verified
qingyangzhang
commited on
Apr 30
Training in progress, step 10
7b2bd57
verified
qingyangzhang
commited on
Apr 30
End of training
a3e3774
verified
qingyangzhang
commited on
Apr 28
Model save
0491de0
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 125
ea7d1c4
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 120
f11d0cd
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 110
4bb36d1
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 100
88d4b0e
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 90
1f7ea58
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 80
48328e6
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 70
82bed5e
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 60
20afed4
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 50
e1cec40
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 40
5c6122d
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 30
1b337a1
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 20
1340761
verified
qingyangzhang
commited on
Apr 28
Training in progress, step 10
51bc79d
verified
qingyangzhang
commited on
Apr 28
initial commit
3925ef2
verified
qingyangzhang
commited on
Apr 27