fieldvalley-llm2025/llm2025_main_merged_dpo02

Final DPO Round 2 model. Optimized to strictly suppress code fences and explanations in JSON output.

Downloads last month
1
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for fieldvalley-llm2025/llm2025_main_merged_dpo02