pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup / model-00001-of-00002.safetensors

Commit History

Upload Qwen2ForCausalLM
67e13b4
verified

ZhuofengLi commited on