d1_science_shortest / train_results.json
neginr's picture
End of training
a481950 verified
raw
history blame contribute delete
209 Bytes
{
"epoch": 5.0,
"total_flos": 1.8323350087155057e+18,
"train_loss": 0.3733112201335942,
"train_runtime": 21298.2648,
"train_samples_per_second": 7.418,
"train_steps_per_second": 0.058
}