Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alphadl
/
R1-Distill-1.5B-Qwen-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
R1-Distill-1.5B-Qwen-GRPO
Commit History
Training in progress, step 22500
5167418
verified
alphadl
commited on
Jun 28
Training in progress, step 22000
07df9b6
verified
alphadl
commited on
Jun 28
Training in progress, step 21500
72096c6
verified
alphadl
commited on
Jun 27
Training in progress, step 21000
d29484f
verified
alphadl
commited on
Jun 27
Training in progress, step 20500
67bcf5d
verified
alphadl
commited on
Jun 27
Training in progress, step 20000
fe19cdc
verified
alphadl
commited on
Jun 27
Training in progress, step 19500
4d1e26a
verified
alphadl
commited on
Jun 27
Training in progress, step 19000
c45952c
verified
alphadl
commited on
Jun 26
Training in progress, step 18500
6396bc6
verified
alphadl
commited on
Jun 26
Training in progress, step 18000
b93a96e
verified
alphadl
commited on
Jun 26
Training in progress, step 17000
c0448d6
verified
alphadl
commited on
Jun 25
Training in progress, step 16500
d69b28d
verified
alphadl
commited on
Jun 25
Training in progress, step 16000
f6dace0
verified
alphadl
commited on
Jun 25
Training in progress, step 15500
4584625
verified
alphadl
commited on
Jun 24
Training in progress, step 15000
ac51c61
verified
alphadl
commited on
Jun 24
Training in progress, step 14500
65e9cb4
verified
alphadl
commited on
Jun 24
Training in progress, step 13500
5ae2f25
verified
alphadl
commited on
Jun 22
Training in progress, step 13000
33a5ea7
verified
alphadl
commited on
Jun 21
Training in progress, step 12500
3e02af8
verified
alphadl
commited on
Jun 21
Training in progress, step 12000
a17523e
verified
alphadl
commited on
Jun 21
End of training
447e170
verified
alphadl
commited on
Jun 21
Model save
0e09188
verified
alphadl
commited on
Jun 21
Training in progress, step 11500
ce30a96
verified
alphadl
commited on
Jun 21
Training in progress, step 11000
147af32
verified
alphadl
commited on
Jun 20
Training in progress, step 10500
1441b08
verified
alphadl
commited on
Jun 20
Training in progress, step 10000
d0449c1
verified
alphadl
commited on
Jun 20
Training in progress, step 9500
8aa00ca
verified
alphadl
commited on
Jun 20
Training in progress, step 9000
9f78f03
verified
alphadl
commited on
Jun 19
Training in progress, step 7500
14249aa
verified
alphadl
commited on
Jun 17
Training in progress, step 7000
e334b21
verified
alphadl
commited on
Jun 17
Training in progress, step 6500
1f8706a
verified
alphadl
commited on
Jun 17
Training in progress, step 6000
dd44429
verified
alphadl
commited on
Jun 16
Training in progress, step 5500
6952c5f
verified
alphadl
commited on
Jun 16
Training in progress, step 5000
0acbe1c
verified
alphadl
commited on
Jun 16
Training in progress, step 4500
8d37741
verified
alphadl
commited on
Jun 15
Training in progress, step 4000
af100de
verified
alphadl
commited on
Jun 15
Training in progress, step 3000
323c057
verified
alphadl
commited on
Jun 14
Training in progress, step 2000
9cda67d
verified
alphadl
commited on
Jun 14
Training in progress, step 1000
ba3a5e0
verified
alphadl
commited on
Jun 13
Training in progress, step 500
e526f3e
verified
alphadl
commited on
Jun 13
initial commit
7659fec
verified
alphadl
commited on
Jun 13