--- library_name: mlx base_model: qingy2024/Qwen3-VLTO-8B-Instruct pipeline_tag: text-generation tags: - mlx --- # CallMcMargin/Qwen3-VLTO-8B-Instruct-mlx-bf16-affine-qgroup32-q8 This model [CallMcMargin/Qwen3-VLTO-8B-Instruct-mlx-bf16-affine-qgroup32-q8](https://huggingface.co/CallMcMargin/Qwen3-VLTO-8B-Instruct-mlx-bf16-affine-qgroup32-q8) was converted to MLX format from [qingy2024/Qwen3-VLTO-8B-Instruct](https://huggingface.co/qingy2024/Qwen3-VLTO-8B-Instruct) using mlx-lm version **0.28.3**. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("CallMcMargin/Qwen3-VLTO-8B-Instruct-mlx-bf16-affine-qgroup32-q8") prompt = "hello" if tokenizer.chat_template is not None: messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) response = generate(model, tokenizer, prompt=prompt, verbose=True) ```