Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
adeelahmad
/
Qwen2-0.5B-GRPO-test
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO-test
Commit History
End of training
4591a4b
verified
adeelahmad
commited on
Mar 28
Model save
fab823e
verified
adeelahmad
commited on
Mar 28
Training in progress, step 113
f6de16f
verified
adeelahmad
commited on
Mar 28
Training in progress, step 110
e7868d6
verified
adeelahmad
commited on
Mar 27
Training in progress, step 100
7306808
verified
adeelahmad
commited on
Mar 27
Training in progress, step 90
4f8ee9f
verified
adeelahmad
commited on
Mar 27
Training in progress, step 80
13cba19
verified
adeelahmad
commited on
Mar 27
Training in progress, step 70
dda68d1
verified
adeelahmad
commited on
Mar 27
Training in progress, step 60
37cf49b
verified
adeelahmad
commited on
Mar 27
Training in progress, step 50
38c2bb0
verified
adeelahmad
commited on
Mar 27
Training in progress, step 40
528ee90
verified
adeelahmad
commited on
Mar 27
Training in progress, step 30
eedfd4b
verified
adeelahmad
commited on
Mar 27
Training in progress, step 20
9e01b3b
verified
adeelahmad
commited on
Mar 27
Training in progress, step 10
0eb17ef
verified
adeelahmad
commited on
Mar 27
initial commit
a4a7458
verified
adeelahmad
commited on
Mar 27