How is the model supposed to be deployed with vLLM?

by diuibyang - opened Aug 14, 2025

Aug 14, 2025

When I tried to deploy the model with VLLM, there is error AttributeError: Model DotsOCRForCausalLM does not support BitsAndBytes quantization yet. No 'packed_modules_mapping' found

ekanshthakur

Aug 22, 2025

Was faced with the same issue

helizac

Owner Aug 24, 2025

Hello, sorry for late response.

Currently, vllm is not supporting dots.ocr ( DotsOCRForCausalLM ). If there is any update, I can update this model or release a similar model that supports vllm as well.

You might follow as:
https://huggingface.co/rednote-hilab/dots.ocr/discussions/20

NoneLand

Sep 26, 2025

Same issue. It seems the PR has been merged into vLLM 4 days ago.

NoneLand

Sep 28, 2025

The latest vllm nightly build wheel has support dots_ocr and can use bitsandbytes inflight.

ritvik-ctx

Nov 11, 2025

This comment has been hidden (marked as Resolved)

miracleyin

Jan 2

using vllm 0.13.0 load this model, but have error like: self.data.shape == loaded_weight.shape

kalle07

Apr 13

any idea how i implement via llama_cpp ?
seem there is no compatible chathandler...

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment