This one works great (from 32 to 52 tps)

#1
by Trilogix1 - opened

It got a great jump from 32-35 to 52 tps. The other models can´t make much difference (still testing though).

Great job.

Unsloth AI org

Nice!

I'd recommend this quantization - https://github.com/mudler/apex-quant
https://huggingface.co/mudler/Qwen3.6-35B-A3B-APEX-MTP-GGUF - gave me a boost in pp and tg.

Sign up or log in to comment