File size: 422 Bytes
bb0649e
 
 
 
 
 
e91c6af
012f70d
 
 
e91c6af
a94af04
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: mit
base_model:
- grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated
---

GGUF quants of: https://huggingface.co/grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated

---

Ctx limits per quant (RTX 3060 12GB, F16 k/v, no offload):
|quant|ctx|
|---|---|
|Q2_K_S|16k|
|iQ3_S|15k|
|Q3_K_S|15k|
|iQ4_XS|12k|
|iQ4_NL|10k (16k with q8_0 k/v)|
|Q4_K_S|10k ( " )|
|Q5_K_S|8k  (15k with q8_0 k/v)|