qwen3-8b-grpo-v6-epoch2 / model-00001-of-00004.safetensors

Commit History

upload phase1_v6judge_run01_Qwen3_8b_3epoch / global_step_34 (merged)
248e2a7
verified

gabriel-xiong commited on