How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull eepos/Qwen3.6-27B-MTP-IQ5_KS-GGUF
Run and chat with the model
lemonade run user.Qwen3.6-27B-MTP-IQ5_KS-GGUF-{{QUANT_TAG}}
List all available models
lemonade list
Quick Links

EXPERIMENTAL, I'm just a hobbyist, this might be crap. Works with a fairly recent version of ik_llama.cpp only.

GGUF from ubergarm/Qwen3.6-27B-GGUF

MTP layer from Radamanthys11/Qwen3.6-27B-MTP-Q8_0-GGUF

combined with a script by tschunschi: https://huggingface.co/ubergarm/Qwen3.6-27B-GGUF/discussions/2#69f537e8dc0f21b75e58123d

Thanks to all of the above!


Settings to enable:

-mtp --draft-max 2 --draft-p-min 0.0

--draft-max 3 might be good also

Downloads last month
87
GGUF
Model size
27B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for eepos/Qwen3.6-27B-MTP-IQ5_KS-GGUF

Base model

Qwen/Qwen3.6-27B
Quantized
(2)
this model