Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blancy
/
Qwen2.5-1.5B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
Blancy/1ktestfrom10kwithdifficultyclasses-without-difficult
qwen3
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-GRPO
/
trainer_state.json
Commit History
Model save
287364c
verified
Blancy
commited on
May 24
Model save
74066ff
verified
Blancy
commited on
May 24
Model save
6fffe1f
verified
Blancy
commited on
May 23
Model save
dcf8c80
verified
Blancy
commited on
May 23
Model save
000fe44
verified
Blancy
commited on
May 23
Model save
772bf21
verified
Blancy
commited on
May 23
Model save
3bffed5
verified
Blancy
commited on
May 22
Model save
7465454
verified
Blancy
commited on
May 22
Model save
54afeb1
verified
Blancy
commited on
May 10
Model save
e6197ad
verified
Blancy
commited on
May 7
Model save
63c75d3
verified
Blancy
commited on
May 7
Model save
68e305f
verified
Blancy
commited on
Apr 28
Model save
fde291d
verified
Blancy
commited on
Apr 26
Model save
c8f38c5
verified
Blancy
commited on
Apr 25
Model save
19bccf7
verified
Blancy
commited on
Apr 25
Model save
66cc85d
verified
Blancy
commited on
Apr 24
Model save
6bd399a
verified
Blancy
commited on
Apr 24
Model save
40b9b95
verified
Blancy
commited on
Apr 24
Model save
72ca9e7
verified
Blancy
commited on
Apr 23
Model save
2b49f5e
verified
Blancy
commited on
Apr 20
Model save
87d48a8
verified
Blancy
commited on
Apr 20
Model save
973b618
verified
Blancy
commited on
Apr 18
Model save
d51bd8e
verified
Blancy
commited on
Apr 15
Training in progress, epoch 0
8e5ebb3
verified
Blancy
commited on
Apr 15
Model save
20a123d
verified
Blancy
commited on
Apr 15