---
license: apache-2.0
base_model: Wan-AI/Wan2.2-T2V-A14B-Diffusers
pipeline_tag: text-to-video
library_name: mlx-gen
tags:
- mlx
- mlx-gen
- mflux
- apple-silicon
- 8-bit
- wan
- wan2.2
- video-generation
- text-to-video
- wan-a14b
---
# wan2.2-t2v-a14b-diffusers-8bit

This repository contains MLX-Gen saved weights for `Wan-AI/Wan2.2-T2V-A14B-Diffusers`. The checkpoint is designed for local Apple Silicon inference with [`mlx-gen`](https://github.com/lpalbou/mlx-gen).

It uses the mflux/MLX saved-weight layout. Quantized checkpoints include MLX quantization tensors. It is not a Diffusers or Transformers `from_pretrained()` checkpoint.

## Source Model

Original model: [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).

## License and Access

This quantized derivative follows the Apache 2.0 license of the source model.

## Quantization

This is an MLX q8 checkpoint for Wan2.2 A14B. MLX-Gen uses 8-bit quantization for Wan modules where MLX supports quantization:

- q8 for quantizable Wan transformer attention and feed-forward modules.
- BF16 for the Wan VAE.
- BF16 for Wan transformer conditioning/output projection linears, the UMT5 text encoder, scheduler metadata, tokenizer files, norms, convolutions, and other non-quantizable parameters.

Wan q4 quality and any possible mixed q4/q8 policy are still under validation. Prefer q8 for publishable Wan checkpoints until the q4 policy is documented.

See the [MLX-Gen quantization docs](https://github.com/lpalbou/mlx-gen/blob/main/docs/quantization.md) for compatibility notes.

## Local Validation

These measurements are validation-sized release checks for this uploaded package. They verify package loading, video integrity, and prompt influence for this profile only; they do not claim full-size `1280x720`, 81-frame, 40-step readiness.

| Measurement | Value |
|---|---:|
| Package disk usage | 39.5 GiB |
| Validation profile | 384x224, 33 frames, 12 steps, 8.0 fps, seed 4242, `--low-ram` |
| Prompt pair | scientist scene / red car scene |
| Video health | 33 / 33 frames decoded, 8.0 fps, nonblank |
| Mean temporal delta | 5.6 / 3.2 luma |
| Prompt delta | 102.0 mean abs RGB |
| Generation time | 162.2 s / 319.6 s |

## Compatibility

Requires `mlx-gen >= 0.18.9`.

Generated with `mlx-gen 0.18.9`.

Use the `mlxgen` command and Python import path for new MLX-Gen projects.

## Usage

The q8 A14B example below is intentionally validation-sized. Do not use this card to claim full-size `1280x720`, 81-frame, 40-step readiness until that exact path has passed video integrity and quality validation.

```bash
python -m pip install -U mlx-gen

mlxgen download --model AbstractFramework/wan2.2-t2v-a14b-diffusers-8bit

mlxgen generate \
  --model AbstractFramework/wan2.2-t2v-a14b-diffusers-8bit \
  --task text-to-video \
  --prompt "Your video prompt here" \
  --width 384 \
  --height 224 \
  --frames 33 \
  --steps 12 \
  --guidance 4 \
  --guidance-2 3 \
  --fps 8 \
  --seed 4242 \
  --metadata \
  --output video.mp4
```

## Attribution

MLX-Gen is based on [mflux](https://github.com/filipstrand/mflux) by Filip Strand and the original mflux contributors. This model card is generated by MLX-Gen so derived checkpoints keep that attribution visible.

Quantized and contributed by [@lpalbou](https://huggingface.co/lpalbou).