Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hyunw3
/
qwen-2.5-0.5b-r1-countdown_lr1.0e-6
like
0
Text Generation
Transformers
TensorBoard
Safetensors
13 languages
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
main
qwen-2.5-0.5b-r1-countdown_lr1.0e-6
Commit History
Improve language tag (
#1
)
a62dc51
verified
hyunw3
lbourdois
commited on
Jun 3
Model save
8e3814a
verified
hyunw3
commited on
Feb 1
Training in progress, step 450
685b33a
verified
hyunw3
commited on
Feb 1
Training in progress, step 400
af1cce2
verified
hyunw3
commited on
Feb 1
Training in progress, step 350
577db40
verified
hyunw3
commited on
Feb 1
Training in progress, step 300
f7a274c
verified
hyunw3
commited on
Feb 1
Training in progress, step 250
28e3048
verified
hyunw3
commited on
Feb 1
Training in progress, step 225
68e351a
verified
hyunw3
commited on
Feb 1
Training in progress, step 200
1502881
verified
hyunw3
commited on
Feb 1
Training in progress, step 175
5013186
verified
hyunw3
commited on
Feb 1
Training in progress, step 150
f8a4808
verified
hyunw3
commited on
Feb 1
Training in progress, step 125
c573e44
verified
hyunw3
commited on
Feb 1
Training in progress, step 50
2a0ef61
verified
hyunw3
commited on
Feb 1
Training in progress, step 25
266c023
verified
hyunw3
commited on
Feb 1
initial commit
908c2fd
verified
hyunw3
commited on
Feb 1