Training runs

#1
by Voidreaper2026 - opened

Hi,

Great work on the Mythos model! I've been working on a similar cybersecurity fine-tune (4B, Qwen3 base) and I'm seeing the same over-inflated severity behaviour yours exhibits β€” CVEs coming out rated higher than they should be.

I'm currently on epoch 1 and about to run a second to see if that tightens the calibration. I was curious what dataset you trained on and how many epochs you ran? Would be great to compare notes.

Thanks

Ask the creator please, we are quantizing, not training =)

Sign up or log in to comment