Qwen3.6-golang
Collection
12 items • Updated
How to use jedisct1/Qwen3.6-35B-go-v2-4bit.mlx with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Qwen3.6-35B-go-v2-4bit.mlx jedisct1/Qwen3.6-35B-go-v2-4bit.mlx
A Go-focused Qwen3.6-35B-A3B model for Apple Silicon, packaged in MLX.
Use it as a coding assistant for Go projects: generating focused patches, explaining diffs, tightening tests, reading tool outputs, and making small repo-aware edits. It was tested with Swival on file-editing and command-running workflows.
This is the plain 4-bit compatibility variant. It does not include native MTP tensors, so it is the best starting point if your MLX loader does not support MTP sidecars.
jedisct1/Qwen3.6-35B-go-v2-8bit.mlx if you want more precision without native MTP.jedisct1/Qwen3.6-35B-go-v2-bf16.mlx if you want full precision without native MTP.jedisct1/Qwen3.6-35B-go-v2-MTP-4bit.mlx if your runtime supports native MTP and you want the faster MTP path.Requires mlx-lm:
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("jedisct1/Qwen3.6-35B-go-v2-4bit.mlx")
messages = [
{"role": "system", "content": "You are an expert Go developer."},
{"role": "user", "content": "Generate a focused patch that replaces the manual retry loop in fetchUser() with the shared retry helper."},
]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
response = generate(model, tokenizer, prompt=prompt, max_tokens=500)
print(response)
4-bit
Base model
Qwen/Qwen3.6-35B-A3B