Model quant requests?

#1
by pathosethoslogos - opened

Been loving this model!

Just out of curiosity, do you take requests? It would be amazing if same quant approach was applied to the new Qwen3.6. 😊

On it, had it on my mind a few hours ago

Could I please ask you to create a high-quality quantization for the Qwen3.5-397B-A17B model?
Intel's current version fits into a dual-Spark setup with full context, but it uses uncalibrated RTN quantization.
As a result, it sometimes performs worse than the 122B model.

https://huggingface.co/Intel/Qwen3.5-397B-A17B-int4-AutoRound
"The model is quantized via RTN mode"

Sign up or log in to comment