a1_code_rosetta / all_results.json
gsmyrnis's picture
End of training
13c5bb3 verified
{
"epoch": 5.0,
"total_flos": 1603895020290048.0,
"train_loss": 0.4141314175447472,
"train_runtime": 23163.7501,
"train_samples_per_second": 6.821,
"train_steps_per_second": 0.053
}