---
license: apache-2.0
language:
- en
base_model:
- huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated
library_name: mlx
pipeline_tag: image-text-to-text
tags:
- image-text-to-text
- vision
- multimodal
- vlm
- reasoning
- distillation
- chain-of-thought
- qwen
- qwen3.6
- mixture-of-experts
- moe
- lora
- unsloth
- abliterated
- uncensored
- mlx
- apple-silicon
- huihui
- quantized
- mxfp4
- mlx-vlm
inference: false
widget:
- text: Summarize the operational risks in this deployment plan.
  example_title: Reasoning prompt
---

# Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp4

`Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp4` is an `mlx-vlm` checkpoint derived from `huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated`, packaged for local multimodal experimentation on Apple Silicon.

## Tested inference path

> **Inference for this checkpoint has been tested with [`LibraxisAI/mlx-batch-server`](https://github.com/LibraxisAI/mlx-batch-server).**  
> This is the recommended tested path for operator-controlled local multimodal mlx-lm / mlx-vlm inference on Apple Silicon.

| Aspect | Status |
|---|---|
| Tested runtime | `LibraxisAI/mlx-batch-server` |
| Target hardware | Apple Silicon |
| Inference mode | Local / self-hosted |
| Hugging Face Hosted Inference | Disabled for this repository (`inference: false`) |

This does not claim compatibility with every possible serving stack. It documents the path that has been exercised for this published checkpoint.

## Intended use

- Local image-and-text reasoning on Apple Silicon
- Multimodal prompting experiments
- Screenshot, document, chart, and visual question-answering workflows
- Operator-controlled local inference where hosted inference is not desired

## Out of scope

- Safety-critical decisions without domain expert review
- Claims of benchmark superiority not backed by published evaluation data
- Non-MLX runtime guarantees
- High-stakes visual interpretation without human validation

## Training and conversion metadata

| Parameter | Value |
|---|---|
| Repository | `LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp4` |
| Base model | `huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated` |
| Task | `image-text-to-text` |
| Library | `mlx` |
| Format | MLX / VMLX checkpoint |
| Quantization | MXFP4 |
| Target platform | Apple Silicon |

This card reports metadata present in the Hugging Face repository, existing frontmatter, or public config files. Missing benchmark, dataset, or training-run details are left explicit rather than reconstructed.

## Usage

Use the library instructions above, or run this checkpoint through the tested local serving path: [`LibraxisAI/mlx-batch-server`](https://github.com/LibraxisAI/mlx-batch-server)

## Validation

End-to-end pipeline test 2026-04-22 on M3 Ultra (load → text → vision → unload), served via `mlx-batch-server`:

| Probe | TTFT | Output chars | Notes |
|---|---|---|---|
| Cold load | — | — | **21 s** from cold to ready |
| Text — simple greeting (PL) | 0.51 s | 601 | Clean output, abliterated behaviour |
| Text — canonical (PL, literary) | 0.29 s | 718 | Concise reasoning trace |
| Vision — JPEG (Monument Valley) | 6.50 s | 873 | Accurate scene description |

3/3 probes passed. `has_reasoning=True` on all probes — this model emits reasoning traces via `<think>` markers.

## Limitations

- Validate outputs on your own domain data before relying on this checkpoint.
- Memory use and speed depend heavily on Apple Silicon generation, unified-memory size, prompt length, and runtime configuration.
- Validation data above reflects M3 Ultra; expect different timings on other hardware.

## License

`apache-2.0`. Check the upstream/base model license as well when a base model is declared.

---

𝚅𝚒𝚋𝚎𝚌𝚛𝚊𝚏𝚝𝚎𝚍. with AI Agents by VetCoders (c)2024-2026 LibraxisAI