How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="pbatra/Llama-3.2-11B-Vision-Instruct-GGUF")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("pbatra/Llama-3.2-11B-Vision-Instruct-GGUF", dtype="auto")
Quick Links

Llama-3.2-11B-Vision-Instruct

This repository contains quantized versions of the model from the original repository: meta-llama/Llama-3.2-11B-Vision-Instruct.

Name Quantization Method Size (GB)
Downloads last month
641
GGUF
Model size
10B params
Architecture
mllama
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for pbatra/Llama-3.2-11B-Vision-Instruct-GGUF

Quantized
(20)
this model