Is it possible to only download the mtp gguf (<1GB one) to use with existing ggufs?

#3
by CHNtentes - opened

Or do I need to download the whole new gguf with mtp inside

Sure! You can download just the MTP gguf from https://huggingface.co/havenoammo/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/35BA3B-MTP.gguf and the convert.py script from https://huggingface.co/havenoammo/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/convert.py, then graft the MTP layers onto your existing model by following this recipe:

# Create and activate a virtual environment
uv venv .venv --seed
source .venv/bin/activate

# Install the gguf library
uv pip install gguf

# Run the grafting script: convert.py <base model> <MTP source> <output>
python convert.py Qwen3.6-27B-UD-Q4_K_XL.gguf MTP-Q8_0.gguf Qwen3.6-27B-MTP-UD-Q4_K_XL.gguf

Sure! You can download just the MTP gguf from https://huggingface.co/havenoammo/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/35BA3B-MTP.gguf and the convert.py script from https://huggingface.co/havenoammo/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/convert.py, then graft the MTP layers onto your existing model by following this recipe:

# Create and activate a virtual environment
uv venv .venv --seed
source .venv/bin/activate

# Install the gguf library
uv pip install gguf

# Run the grafting script: convert.py <base model> <MTP source> <output>
python convert.py Qwen3.6-27B-UD-Q4_K_XL.gguf MTP-Q8_0.gguf Qwen3.6-27B-MTP-UD-Q4_K_XL.gguf

Thanks!

So if i have a IQ4_NL from UD, i can still patch it with MTP-Q8_0.gguf??

So if i have a IQ4_NL from UD, i can still patch it with MTP-Q8_0.gguf??

Yes, that's right.

Sign up or log in to comment