qwen3-06.B-sft / train_results.json
vanek-epfl's picture
Model save
641104f verified
{
"epoch": 1.97196261682243,
"total_flos": 4568119699832832.0,
"train_loss": 2.4587962728626325,
"train_runtime": 293.5611,
"train_samples": 1000,
"train_samples_per_second": 1.458,
"train_steps_per_second": 0.361
}