---
language:
  - en
license: apache-2.0
library_name: llama-cpp
tags:
  - gguf
  - vision
  - multimodal
  - forkjoin-ai
base_model: Qwen/Qwen3-VL-2B-Instruct
pipeline_tag: image-text-to-text
---

# Qwen3 Vl 2B Instruct

Forkjoin.ai conversion of [Qwen/Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct) to GGUF format for edge deployment.

## Model Details

- **Source Model**: [Qwen/Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct)
- **Format**: GGUF
- **Converted by**: [Forkjoin.ai](https://forkjoin.ai)

## Usage

### With llama.cpp

```bash
./llama-cli -m Qwen3VL-2B-Instruct-Q8_0.gguf --mmproj mmproj-Qwen3VL-2B-Instruct-Q8_0.gguf -p "Your prompt here" -n 256
```

### With Ollama

Create a `Modelfile`:
```
FROM ./Qwen3VL-2B-Instruct-Q8_0.gguf
```

```bash
ollama create qwen3-vl-2b-instruct-gguf -f Modelfile
ollama run qwen3-vl-2b-instruct-gguf
```

## About Forkjoin.ai

[Forkjoin.ai](https://forkjoin.ai) runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities.

All conversions are optimized for edge deployment within browser and mobile memory constraints.

## License

Apache 2.0 (follows upstream model license)