EXPERIMENTAL FIX... THE MODEL DOESNT HAVE ANY ISSUE... THOUGH IT DOES SEPPOKU... one of the issue we find... not really a massive issue! just follow our recommended setting... this is just a remnant

Uploaded finetuned model

  • Developed by: N-Bot-Int
  • License: agpl-3.0
  • Finetuned from model : N-Bot-Int/mrgrtv1-8b-merged

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
172
Safetensors
Model size
5B params
Tensor type
F32
·
F16
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for N-Bot-Int/mrgrtv1-grpo-merged-eos-fix

Quantizations
1 model