a1_code_opencodereasoning / train_results.json
gsmyrnis's picture
End of training
f68fc85 verified
{
"epoch": 5.0,
"total_flos": 3167726939537408.0,
"train_loss": 0.46034835494964227,
"train_runtime": 30235.0271,
"train_samples_per_second": 5.226,
"train_steps_per_second": 0.041
}