BS-riche-Qwen2.5-lora-2 / train_results.json
jpraysz's picture
Upload 20 files
620b87d verified
raw
history blame contribute delete
259 Bytes
{
"epoch": 2.9904761904761905,
"num_input_tokens_seen": 8039616,
"total_flos": 3.746792787278561e+17,
"train_loss": 0.0682682229922368,
"train_runtime": 5945.5431,
"train_samples_per_second": 0.53,
"train_steps_per_second": 0.033
}