RossAscends commited on
Commit
e96d6e7
·
verified ·
1 Parent(s): a94af04

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -9,12 +9,12 @@ GGUF quants of: https://huggingface.co/grimjim/gemma-3-12b-it-norm-preserved-bip
9
  ---
10
 
11
  Ctx limits per quant (RTX 3060 12GB, F16 k/v, no offload):
12
- |quant|ctx|
13
- |---|---|
14
  |Q2_K_S|16k|
15
  |iQ3_S|15k|
16
  |Q3_K_S|15k|
17
  |iQ4_XS|12k|
18
- |iQ4_NL|10k (16k with q8_0 k/v)|
19
- |Q4_K_S|10k ( " )|
20
- |Q5_K_S|8k (15k with q8_0 k/v)|
 
9
  ---
10
 
11
  Ctx limits per quant (RTX 3060 12GB, F16 k/v, no offload):
12
+ |quant|ctx|comment|
13
+ |---|---|---|
14
  |Q2_K_S|16k|
15
  |iQ3_S|15k|
16
  |Q3_K_S|15k|
17
  |iQ4_XS|12k|
18
+ |iQ4_NL|10k|(16k with q8_0 k/v)|
19
+ |Q4_K_S|10k|( " )|
20
+ |Q5_K_S|8k|(14k with q8_0 k/v)|