Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blancy
/
Qwen3-0.6B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
Blancy/1ktestfrom10kwithdifficultyclasses
qwen3
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen3-0.6B-Open-R1-GRPO
Commit History
Training in progress, step 36
0e9118b
verified
Blancy
commited on
10 days ago
Training in progress, step 36
4da7567
verified
Blancy
commited on
10 days ago
Training in progress, step 36
085332c
verified
Blancy
commited on
11 days ago
Training in progress, step 36
0952902
verified
Blancy
commited on
11 days ago
End of training
de97277
verified
Blancy
commited on
28 days ago
Model save
1dcb3d0
verified
Blancy
commited on
28 days ago
Training in progress, step 35
6ab119d
verified
Blancy
commited on
28 days ago
Training in progress, step 30
8796db9
verified
Blancy
commited on
28 days ago
Training in progress, step 25
f802044
verified
Blancy
commited on
28 days ago
Training in progress, step 20
a3f0e5c
verified
Blancy
commited on
28 days ago
Training in progress, step 15
5c569e8
verified
Blancy
commited on
28 days ago
Training in progress, step 10
24957d4
verified
Blancy
commited on
28 days ago
Training in progress, step 5
216d05a
verified
Blancy
commited on
29 days ago
Training in progress, step 35
16240d5
verified
Blancy
commited on
29 days ago
Training in progress, step 30
c1d65c8
verified
Blancy
commited on
29 days ago
Training in progress, step 25
fd3a8b2
verified
Blancy
commited on
29 days ago
Training in progress, step 20
2dd21dc
verified
Blancy
commited on
29 days ago
Training in progress, step 15
ceb27c1
verified
Blancy
commited on
29 days ago
Training in progress, step 10
a752bd3
verified
Blancy
commited on
29 days ago
Training in progress, step 5
68a34bd
verified
Blancy
commited on
29 days ago
initial commit
0cc45e5
verified
Blancy
commited on
29 days ago