math_llama3_reset_dpo_100_0_0.83 / trainer_state.json

Commit History

Upload folder using huggingface_hub
8da0bf5
verified

lzc0525 commited on