llama3.1-8b-grpo-test / model-00004-of-00004.safetensors

Commit History

Trained with Unsloth
88c41ba
verified

JUNGU commited on