Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW / model-00001-of-00002.safetensors

Commit History

Training in progress, epoch 1
7494e4f
verified

Lansechen commited on

Training in progress, epoch 0
1ced99c
verified

Lansechen commited on