Gammaception commited on
Commit
12ae947
·
verified ·
1 Parent(s): e733a92

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -7,7 +7,7 @@ Optimized llama.cpp GGUF quants of Qwen 3.5 27b made for 16gb VRAM cards.
7
 
8
  Made using [GGUF-Tool-suite!](https://github.com/Thireus/GGUF-Tool-Suite/) by Thireus, tuned by me :) (Gammaception)
9
 
10
- Best 16 gb config for GC IQ3_M: (headless 80k ctx kv-cache@(q8_0,q8_0), -ub 256, q8_0 mmproj). Following official sampling parameters recommended, along with reasoning budget + message
11
 
12
 
13
  EDIT 24/03/26: Updated GGUF to fix coding quality, new charts
 
7
 
8
  Made using [GGUF-Tool-suite!](https://github.com/Thireus/GGUF-Tool-Suite/) by Thireus, tuned by me :) (Gammaception)
9
 
10
+ Best 16 gb config for GC IQ3_M: (headless 64k ctx kv-cache@(q8_0,q8_0), -ub 256, q8_0 mmproj). Following official sampling parameters recommended, along with reasoning budget + message
11
 
12
 
13
  EDIT 24/03/26: Updated GGUF to fix coding quality, new charts