Qwen4b-SFT-d9-merged-after-dpo / model-00002-of-00002.safetensors

Commit History

(Trained with Unsloth)
ea955f1
verified

Rakushaking commited on