How to use from
Docker Model Runner
docker model run hf.co/CoNDeNse-AI/Claude4.6-Qwen3-1.7B-CoNDeNse-GGUF:
Quick Links

Claude4.6-Qwen3-1.7B-CoNDeNse-GGUF

GGUF quantizations of Claude4.6-Qwen3-1.7B-CoNDeNse by CoNDeNse-AI


Available Quantizations

Quant File Type Recommended Use
Q2_K Ultra-small Maximum speed / very low RAM
Q3_K_M Lightweight Good quality-to-size ratio
Q4_K_M Balanced Best general-use quant
Q5_K_M High quality Strong reasoning retention
Q6_K Near-lossless Best quality while staying quantized

Model Details

  • Base Model: Qwen3-1.7B

  • Architecture: Qwen3

  • Fine-tuned by: CoNDeNse-AI

  • Format: GGUF

  • Compatibility:

    • LM Studio
    • llama.cpp
    • Ollama
    • KoboldCpp
    • Jan
    • Text Generation WebUI

Quantization Overview

Q2_K

Optimized for extremely low memory usage and fast inference on weak hardware.

Q3_K_M

Recommended lightweight daily-driver quant with solid quality retention.

Q4_K_M

Best balance between reasoning quality, speed, and size.

Q5_K_M

High-quality quant with noticeably stronger reasoning and response consistency.

Q6_K

Near-full precision experience with excellent output quality while remaining efficient.


Example llama.cpp Usage

./llama-cli \
  -m Claude4.6-Qwen3-1.7B-Q6_K.gguf \
  -p "Explain quantum tunneling like I'm 12."

Recommended Settings

Setting Value
Temperature 0.6 - 0.8
Top-p 0.9
Context Length 8k+
Repeat Penalty 1.05

Notes

This model is optimized for:

  • reasoning
  • search-aware responses
  • compact inference
  • edge deployment
  • efficient local usage

Performance may vary depending on backend, prompt format, and quantization level.


Credits


License

Please follow the original Qwen license and usage terms.

Downloads last month
682
GGUF
Model size
2B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for CoNDeNse-AI/Claude4.6-Qwen3-1.7B-CoNDeNse-GGUF

Finetuned
Qwen/Qwen3-1.7B
Quantized
(281)
this model

Collection including CoNDeNse-AI/Claude4.6-Qwen3-1.7B-CoNDeNse-GGUF