--- base_model: mikoy92/Unlimited-OCR-bf16-mlx license: mit language: - multilingual pipeline_tag: image-text-to-text tags: - mlx - mlx-vlm - ocr - vision-language - baidu - deepseekocr - quantized - 4-bit - affine library_name: mlx --- # Unlimited-OCR 4-bit MLX This is a 4-bit affine MLX quantization of [`mikoy92/Unlimited-OCR-bf16-mlx`](https://huggingface.co/mikoy92/Unlimited-OCR-bf16-mlx), converted with `mlx-vlm`. Quantization settings: - mode: `affine` - bits: `4` - group size: `64` - observed effective bits per weight during conversion: `5.883` Because this is a vision-language OCR model, `mlx-vlm` does not aggressively quantize every multimodal tensor; the effective bits-per-weight can be higher than exactly 4-bit. ## Usage ```bash pip install -U mlx-vlm mlx_vlm.generate \ --model mikoy92/Unlimited-OCR-4bit-mlx \ --image /path/to/image.png \ --prompt "Extract all readable text from this image." \ --max-tokens 512 \ --temperature 0 ``` ## Validation Before upload, this checkpoint was loaded locally with `mlx_vlm.generate` and produced OCR text/table output on a document-image smoke test.