Qwen4b-SFT-d9-merged-after-dpo / model-00001-of-00002.safetensors

Commit History

(Trained with Unsloth)
8e6f2a2
verified

Rakushaking commited on