Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
longlian
/
Qwen2-0.5B-GRPO-peft-demo
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO-peft-demo
Commit History
Model save
5b2b878
verified
longlian
commited on
Feb 14
Training in progress, step 226
270e691
verified
longlian
commited on
Feb 14
Training in progress, step 220
b916db4
verified
longlian
commited on
Feb 14
Training in progress, step 210
81bf694
verified
longlian
commited on
Feb 14
Training in progress, step 200
4a84a47
verified
longlian
commited on
Feb 14
Training in progress, step 190
4b46313
verified
longlian
commited on
Feb 14
Training in progress, step 180
0d09d20
verified
longlian
commited on
Feb 14
Training in progress, step 170
246b71e
verified
longlian
commited on
Feb 14
Training in progress, step 160
af0cf36
verified
longlian
commited on
Feb 14
Training in progress, step 150
b2ba9b8
verified
longlian
commited on
Feb 14
Training in progress, step 140
f55c7b5
verified
longlian
commited on
Feb 14
Training in progress, step 130
350f11a
verified
longlian
commited on
Feb 14
Training in progress, step 120
3feb19a
verified
longlian
commited on
Feb 14
Training in progress, step 110
3213da8
verified
longlian
commited on
Feb 14
Training in progress, step 100
0286a68
verified
longlian
commited on
Feb 14
Training in progress, step 90
bbab02a
verified
longlian
commited on
Feb 14
Training in progress, step 80
0e30370
verified
longlian
commited on
Feb 14
Training in progress, step 70
90ba596
verified
longlian
commited on
Feb 14
Training in progress, step 60
5fabde9
verified
longlian
commited on
Feb 14
Training in progress, step 50
a6c02bb
verified
longlian
commited on
Feb 14
Training in progress, step 40
54a0ea1
verified
longlian
commited on
Feb 14
Training in progress, step 30
b31d6bb
verified
longlian
commited on
Feb 14
Training in progress, step 20
77108ec
verified
longlian
commited on
Feb 14
Training in progress, step 10
5d99cbd
verified
longlian
commited on
Feb 14
Model save
469a29f
verified
longlian
commited on
Feb 14
Training in progress, step 113
0afaa1b
verified
longlian
commited on
Feb 14
Training in progress, step 110
9d207b5
verified
longlian
commited on
Feb 14
Training in progress, step 100
246fc0e
verified
longlian
commited on
Feb 14
Training in progress, step 90
28209ea
verified
longlian
commited on
Feb 14
Training in progress, step 80
642491e
verified
longlian
commited on
Feb 14
Training in progress, step 70
211a9e1
verified
longlian
commited on
Feb 14
Training in progress, step 60
15343dd
verified
longlian
commited on
Feb 14
Training in progress, step 50
389522c
verified
longlian
commited on
Feb 14
Training in progress, step 40
414f24e
verified
longlian
commited on
Feb 14
Training in progress, step 30
2d28129
verified
longlian
commited on
Feb 14
Training in progress, step 20
c1ecec9
verified
longlian
commited on
Feb 14
Training in progress, step 10
45f9918
verified
longlian
commited on
Feb 14
Training in progress, step 90
a6dfc04
verified
longlian
commited on
Feb 14
Training in progress, step 80
f34e207
verified
longlian
commited on
Feb 14
Training in progress, step 70
8a4d5bf
verified
longlian
commited on
Feb 14
Training in progress, step 60
e4224bd
verified
longlian
commited on
Feb 14
Training in progress, step 50
17d5152
verified
longlian
commited on
Feb 14
Training in progress, step 40
dced446
verified
longlian
commited on
Feb 14
Training in progress, step 30
bbdd3ec
verified
longlian
commited on
Feb 14
Training in progress, step 20
5ea82da
verified
longlian
commited on
Feb 14
Training in progress, step 10
f91f181
verified
longlian
commited on
Feb 14
initial commit
e20c78f
verified
longlian
commited on
Feb 14