qwen2.5-3b-unsloth-bnb-4bit-grpo-gsm8k / adapter_model.safetensors

Commit History

End of training
9585fc1
verified

mohitmayank commited on