--- license: apache-2.0 language: [en, zh] base_model: Qwen/Qwen3.5-27B tags: [mlx, mlx-node, quantized, awq, 6-bit, qwen3.5, hybrid-attention, gated-delta-net, apple-silicon, unsloth-dynamic] library_name: mlx-node quantized_by: mlx-node pipeline_tag: text-generation model_type: qwen3_5 --- # Qwen3.5-27B — UD-Q6_K_XL (mlx-node) 6-bit base mixed-precision quantization of [Qwen/Qwen3.5-27B](https://huggingface.co/Qwen/Qwen3.5-27B) for Apple Silicon via [mlx-node](https://github.com/mlx-node/mlx-node). | | Original (BF16) | This Model | |---|---|---| | **Size** | ~51 GB | **27 GB** | | **Precision** | BF16 uniform | Mixed 6/8/8/8/8-bit + BF16 | ## All Variants | Repo | GGUF Equivalent | Size | |---|---|---| | [Brooooooklyn/Qwen3.5-27B-UD-Q2_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q2_K_XL-mlx) | UD-Q2_K_XL | 15 GB | | [Brooooooklyn/Qwen3.5-27B-UD-Q3_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q3_K_XL-mlx) | UD-Q3_K_XL | 17 GB | | [Brooooooklyn/Qwen3.5-27B-UD-Q4_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q4_K_XL-mlx) | UD-Q4_K_XL | 20 GB | | [Brooooooklyn/Qwen3.5-27B-UD-Q5_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q5_K_XL-mlx) | UD-Q5_K_XL | 24 GB | | [Brooooooklyn/Qwen3.5-27B-UD-Q6_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q6_K_XL-mlx) | UD-Q6_K_XL | 27 GB | | [Brooooooklyn/Qwen3.5-27B-UD-Q8_K_XL-mlx](https://huggingface.co/Brooooooklyn/Qwen3.5-27B-UD-Q8_K_XL-mlx) | UD-Q8_K_XL | 29 GB | ## Per-Tensor Bit Assignments (N=6) | Weight | Bits | |---|---| | embed_tokens | 8-bit | | lm_head | 8-bit | | self_attn.q/k/v_proj | 8-bit + AWQ | | linear_attn.in_proj_qkv/z | 8-bit + AWQ | | self_attn.o_proj | bf16 | | linear_attn.out_proj | bf16 | | down_proj | 8-bit | | gate_proj, up_proj | 6-bit | Based on [Unsloth Dynamic 2.0](https://unsloth.ai/docs/models/qwen3.5/gguf-benchmarks). [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).