DeepSeek-R1-Distill-Qwen-7B-GRPO-v8 / model-00003-of-00004.safetensors

Commit History

Training in progress, step 385
01d412e
verified

Kadins commited on

Training in progress, step 350
5fd8273
verified

Kadins commited on

Training in progress, step 300
d79b464
verified

Kadins commited on

Training in progress, step 250
28c076c
verified

Kadins commited on

Training in progress, step 200
479fbbc
verified

Kadins commited on

Training in progress, step 150
159415f
verified

Kadins commited on

Training in progress, step 100
b9565e5
verified

Kadins commited on

Training in progress, step 50
b5a5bb7
verified

Kadins commited on