Qwen3.6-35B-A3B PARO full4096-e5 — legacy/original format

This is the original ParoQuant export for Qwen/Qwen3.6-35B-A3B, using the full4096-e5 calibration run.

  • Format: legacy/original ParoQuant safetensors export
  • Quantization: W4A16 ParoQuant, bits=4, group_size=128, krot=8
  • model.safetensors: 23,284,714,104 bytes
  • Artifact BPW: 5.3222 using a 35B denominator
  • Contains the original duplicate fp16 .weight fallback tensors for modules that also have .qweight

A fully packed version with those duplicate fallback tensors removed is available separately at:

Quality reference

Canonical tx4/quality3 evaluation against the original BF16 HF model:

Model PPL ↓ ΔNLL ↓ KL nats ↓ Top-1 % ↑
PARO full4096-e5 6.6216 +0.009506 0.034684 92.000

Notes

This artifact requires a ParoQuant-compatible loader/runtime; it is not a plain unquantized Transformers checkpoint.

Downloads last month
22
Safetensors
Model size
7B params
Tensor type
I32
·
F16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shisa-ai/Qwen3.6-35B-A3B-PARO-full4096-e5

Quantized
(420)
this model