Image-to-Text
PEFT
Safetensors
Vietnamese
vision-language
qwen
vlm
lora
adapter