GGUF
llama.cpp
unsloth
qwen3.6
conversational
How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf TeichAI/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-GGUF:
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default TeichAI/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-GGUF:
Run Hermes
hermes
Quick Links

Qwen3.6 27B x Claude Opus 4.x - v2

Benchmarks

alt_text

Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2
         arc   arc/e boolq hswag obkqa piqa  wino
mxfp8    0.665,0.831,0.910,0.790,0.456,0.813,0.772

Qwen3.6-27B
         arc   arc/e boolq hswag obkqa piqa  wino
mxfp8    0.647,0.803,0.910,0.773,0.450,0.806,0.742

Provided by @nightmedia. All benchmarks were done in mxfp8 precision

🧬 Datasets:

⚡ Use cases

  • Coding
  • Creative Writing
  • Visual Understanding
  • General Purpose

Citations and Contributions

  • @unsloth - This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
  • @Qwen - Providing a fantastic, native-multimodal base model

Usage

If you need help setting up and configuring this model please follow the Qwen team's instructions in the original model's README

Downloads last month
11,404
GGUF
Model size
27B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TeichAI/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-GGUF

Base model

Qwen/Qwen3.6-27B
Quantized
(10)
this model

Datasets used to train TeichAI/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-GGUF