gemma-2-2b-it-grpo-gsm8k / model-00001-of-00002.safetensors

Commit History

2 training epochs
9068f69
verified

lmassaron commited on

1024 training steps
e540d3b
verified

lmassaron commited on