qwen3-8b-grpo-v2-epoch1 / model-00001-of-00004.safetensors

Commit History

upload phase1_v2judge_run02_Qwen3_8b_3epoch / global_step_17 (merged)
24664b3
verified

gabriel-xiong commited on