RossAscends commited on
Commit
a94af04
·
verified ·
1 Parent(s): 3d8c042

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -9,10 +9,12 @@ GGUF quants of: https://huggingface.co/grimjim/gemma-3-12b-it-norm-preserved-bip
9
  ---
10
 
11
  Ctx limits per quant (RTX 3060 12GB, F16 k/v, no offload):
12
- - Q2_K_S = 16k
13
- - iQ3_S = 15k
14
- - Q3_K_S = 15k
15
- - iQ4_XS = 12k
16
- - iQ4_NL = 10k (16k with q8_0 k/v)
17
- - Q4_K_S = 10k ( " )
18
- - Q5_K_S = 8k (15k with q8_0 k/v)
 
 
 
9
  ---
10
 
11
  Ctx limits per quant (RTX 3060 12GB, F16 k/v, no offload):
12
+ |quant|ctx|
13
+ |---|---|
14
+ |Q2_K_S|16k|
15
+ |iQ3_S|15k|
16
+ |Q3_K_S|15k|
17
+ |iQ4_XS|12k|
18
+ |iQ4_NL|10k (16k with q8_0 k/v)|
19
+ |Q4_K_S|10k ( " )|
20
+ |Q5_K_S|8k (15k with q8_0 k/v)|