--- base_model: mikoy92/Unlimited-OCR-bf16-mlx license: mit language: - multilingual pipeline_tag: image-text-to-text tags: - mlx - mlx-vlm - ocr - vision-language - baidu - deepseekocr - quantized - 8-bit - affine library_name: mlx --- # Unlimited-OCR 8-bit MLX This is an 8-bit affine MLX quantization of [`mikoy92/Unlimited-OCR-bf16-mlx`](https://huggingface.co/mikoy92/Unlimited-OCR-bf16-mlx), converted with `mlx-vlm`. Quantization settings: - mode: `affine` - bits: `8` - group size: `64` - observed effective bits per weight during conversion: `9.408` Because this is a vision-language OCR model, `mlx-vlm` does not aggressively quantize every multimodal tensor; the effective bits-per-weight can be higher than exactly 8-bit. ## Usage ```bash pip install -U mlx-vlm mlx_vlm.generate \ --model mikoy92/Unlimited-OCR-8bit-mlx \ --image /path/to/image.png \ --prompt "Extract all readable text from this image." \ --max-tokens 512 \ --temperature 0 ``` ## Validation Before upload, this checkpoint was loaded locally with `mlx_vlm.generate` and produced OCR text/table output on a document-image smoke test.