Llama-3.2-3B-Instruct-Open-R1-GRPO-2 / model-00001-of-00002.safetensors

Commit History

Training in progress, epoch 2
774717b
verified

zztheaven commited on

Training in progress, epoch 1
34dd16a
verified

zztheaven commited on