jasonhuang3
/
dpo-llama-3-1-8b-math-ep3

Model card Files Files and versions Metrics Training metrics Community