train_qqp_1753094139 / train_results.json
rbelanec's picture
End of training
8c69a36 verified
raw
history blame contribute delete
253 Bytes
{
"epoch": 10.0,
"num_input_tokens_seen": 250787112,
"total_flos": 1.1314919730737775e+19,
"train_loss": 0.03974567347884284,
"train_runtime": 164298.4475,
"train_samples_per_second": 19.931,
"train_steps_per_second": 4.983
}