Text-to-Image
Diffusers
Safetensors
qwen2_5_vl
lora
template:diffusion-lora
4-bit precision
bitsandbytes
Instructions to use Salmankotakuth/Arabic-OCR-Qwen2.5-VL-7B-Instruct-bnb-4bit-lora-merged with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Salmankotakuth/Arabic-OCR-Qwen2.5-VL-7B-Instruct-bnb-4bit-lora-merged with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("fill-in-base-model", dtype=torch.bfloat16, device_map="cuda") pipe.load_lora_weights("Salmankotakuth/Arabic-OCR-Qwen2.5-VL-7B-Instruct-bnb-4bit-lora-merged") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Draw Things
- DiffusionBee
Qwen-lora-merged-4bit-model
Model description
The Qwen LLM can be finetuned with LoRA adapters by freezing the original model weights and only training a small number of new, low-rank matrices. The original, large model is left untouched, and the small adapter weights are trained to capture the information for a new, specific task. This makes the process much more efficient and less computationally expensive than traditional fine-tuning. After the finetuning, we can merge the Qwen model and the LoRa adapters to form a single model, which can be run using vLLM or Ollama.
Download model
Download them in the Files & versions tab.
- Downloads last month
- 7