Qwen3.5 27B Aconite v0 — Q6_K GGUF

Q6_K quantization of trashpanda-org/qwen3.5-27b-aconite-v0.

Quant Details

Property Value
Source trashpanda-org/qwen3.5-27b-aconite-v0
Quant Q6_K (6.56 BPW)
Size ~21 GB
Format GGUF (llama.cpp)
Original Precision bf16

Usage

Load with any llama.cpp-compatible runtime (llama.cpp, KoboldCpp, ollama, LM Studio, etc.):

llama-cli -m qwen3.5-27b-aconite-v0-Q6_K.gguf -p "Your prompt here"

Notes

  • Quantized from the bf16 source weights using llama.cpp's convert_hf_to_gguf.pyllama-quantize
  • Q6_K preserves near-original quality at ~41% of the bf16 size
  • Fits comfortably on 2×T4 (32 GB) without CPU offload
Downloads last month
38
GGUF
Model size
27B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for finis-est/qwen3.5-27b-aconite-v0-Q6_K

Base model

Qwen/Qwen3.5-27B
Quantized
(1)
this model