--- language: - en license: apache-2.0 library_name: llama-cpp tags: - gguf - vision - multimodal - forkjoin-ai base_model: Qwen/Qwen3-VL-2B-Instruct pipeline_tag: image-text-to-text --- # Qwen3 Vl 2B Instruct Forkjoin.ai conversion of [Qwen/Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct) to GGUF format for edge deployment. ## Model Details - **Source Model**: [Qwen/Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct) - **Format**: GGUF - **Converted by**: [Forkjoin.ai](https://forkjoin.ai) ## Usage ### With llama.cpp ```bash ./llama-cli -m Qwen3VL-2B-Instruct-Q8_0.gguf --mmproj mmproj-Qwen3VL-2B-Instruct-Q8_0.gguf -p "Your prompt here" -n 256 ``` ### With Ollama Create a `Modelfile`: ``` FROM ./Qwen3VL-2B-Instruct-Q8_0.gguf ``` ```bash ollama create qwen3-vl-2b-instruct-gguf -f Modelfile ollama run qwen3-vl-2b-instruct-gguf ``` ## About Forkjoin.ai [Forkjoin.ai](https://forkjoin.ai) runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities. All conversions are optimized for edge deployment within browser and mobile memory constraints. ## License Apache 2.0 (follows upstream model license)