QwenLeanSFT_0326 / train_results.json
WhiteGiverPlus's picture
Upload folder using huggingface_hub
49de29c verified
raw
history blame contribute delete
225 Bytes
{
"epoch": 1.9968798751950079,
"total_flos": 2.5563512438029025e+18,
"train_loss": 0.12861698282261688,
"train_runtime": 5934.2279,
"train_samples_per_second": 10.368,
"train_steps_per_second": 0.162
}