Qwen2.5-1.5B-Open-R1-Distill / train_results.json
DeeLearning's picture
Model save
3c0e154 verified
raw
history blame
231 Bytes
{
"epoch": 1.0,
"total_flos": 76916824473600.0,
"train_loss": 0.7337305870281874,
"train_runtime": 1235.0855,
"train_samples": 16610,
"train_samples_per_second": 17.497,
"train_steps_per_second": 0.137
}