Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jforeverss
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Commit History
End of training
3ee313b
verified
Jforeverss
commited on
29 days ago
Model save
111d851
verified
Jforeverss
commited on
29 days ago
Training in progress, epoch 1
f1086dd
verified
Jforeverss
commited on
29 days ago
End of training
c15a16e
verified
Jforeverss
commited on
May 9
Model save
17ba808
verified
Jforeverss
commited on
May 9
Training in progress, epoch 1
12a4d7e
verified
Jforeverss
commited on
May 9
End of training
e2865ee
verified
Jforeverss
commited on
May 5
Model save
8b0966a
verified
Jforeverss
commited on
May 5
Training in progress, epoch 1
e6409a8
verified
Jforeverss
commited on
May 5
End of training
49dec69
verified
Jforeverss
commited on
Apr 30
Model save
bea6f2a
verified
Jforeverss
commited on
Apr 30
Training in progress, epoch 1
4aee0eb
verified
Jforeverss
commited on
Apr 30
End of training
530464a
verified
Jforeverss
commited on
Apr 29
Model save
4304937
verified
Jforeverss
commited on
Apr 29
Training in progress, epoch 1
7256090
verified
Jforeverss
commited on
Apr 29
End of training
b4a60bb
verified
Jforeverss
commited on
Apr 28
Model save
ea8a694
verified
Jforeverss
commited on
Apr 28
Training in progress, epoch 1
56eb322
verified
Jforeverss
commited on
Apr 28
End of training
48db4d2
verified
Jforeverss
commited on
Apr 27
Model save
60801ac
verified
Jforeverss
commited on
Apr 27
Training in progress, epoch 1
8002090
verified
Jforeverss
commited on
Apr 27
End of training
703e187
verified
Jforeverss
commited on
Apr 27
Model save
dddfa20
verified
Jforeverss
commited on
Apr 27
Training in progress, epoch 1
13a2dd6
verified
Jforeverss
commited on
Apr 27
End of training
2bd0882
verified
Jforeverss
commited on
Apr 26
Model save
6a7d737
verified
Jforeverss
commited on
Apr 26
Training in progress, epoch 1
6b6d082
verified
Jforeverss
commited on
Apr 26
End of training
77f11d6
verified
Jforeverss
commited on
Apr 25
Model save
a29363d
verified
Jforeverss
commited on
Apr 25
Training in progress, step 25
65b75c4
verified
Jforeverss
commited on
Apr 25
End of training
8d39ed5
verified
Jforeverss
commited on
Apr 24
Model save
4ae392f
verified
Jforeverss
commited on
Apr 24
Training in progress, step 25
634384a
verified
Jforeverss
commited on
Apr 24
Training in progress, step 1000
b7dffd4
verified
Jforeverss
commited on
Apr 24
initial commit
6fa5771
verified
Jforeverss
commited on
Apr 18