---
license: apache-2.0
base_model: huihui-ai/Huihui-Qwen3.5-0.8B-abliterated
tags:
  - mnn
  - abliterated
  - uncensored
  - qwen3.5
  - on-device
  - mobile
language:
  - en
library_name: mnn
---

# Qwen3.5-0.8B Uncensored — MNN Format (mix-mask export)

MNN conversion of [huihui-ai/Huihui-Qwen3.5-0.8B-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-0.8B-abliterated) (vision included) for on-device inference. All credit for the abliteration goes to huihui-ai.

- Hybrid attention: 18 LinearAttention + 6 full-attention layers of 24 — self-describing metadata (`attention_type:"mix"`, `layer_nums:24`, `sliding_window:4`), stacked mask graph `[2,1,1,seq,kv]`
- Quantization: 4-bit, block 64, HQQ (matches the taobao-mnn 0.8B bundle parameters)
- Tokenizer: `tokenizer.mtok` (new format)
- Exporter: MNN-TokForge `tokforge/rebase-3.5.0` @ `21c339c2fd` (upstream #4336 attention-type detection)
- Requires an MNN runtime with the mix-mask `gen_attention_mask` path (MNN-TokForge ≥ `tokforge/rebase-3.5.0`); legacy single-mask runtimes are NOT compatible
- Validated: multi-turn coherence (3 cycles/session, CPU), LinearAttention snapshot/restore (Y=Z), self-describing hybrid metadata verified against the exported graph

## File checksums (md5)

```
3af7de552fb9cea05e67f615d96efb53  llm.mnn
10fe7c624159f62c69d93a0e836fc80f  llm.mnn.weight
88bbc9851fc9e2aae364eaf59698326a  llm_config.json
3c183b4e10bef80909cb6998fe5e4fa6  tokenizer.mtok
06b782392bdb4ca19a1045c3709fdbb6  visual.mnn
e48e427a48dc363278ecaacb7810db6a  visual.mnn.weight
```