Qwen-1.5B-L44-Flat / all_results.json
ZMC2019's picture
Model save
df0c248 verified
raw
history blame contribute delete
217 Bytes
{
"total_flos": 6105625239486464.0,
"train_loss": 0.028776329486367866,
"train_runtime": 7678.0521,
"train_samples": 93733,
"train_samples_per_second": 17.891,
"train_steps_per_second": 1.118
}