RemySkye commited on
Commit
ec910dd
·
verified ·
1 Parent(s): 26da3bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -18,7 +18,7 @@ tags:
18
 
19
  # MiniCPM5-1B-i1-GGUF
20
 
21
- This repository contains GGUF format quantizations of the base model openbmb/MiniCPM5-1B. These models were generated using llama.cpp with an Importance Matrix (imatrix) - https://gist.github.com/bartowski1182/82ae9b520227f57d79ba04add13d0d0d
22
 
23
  ## Models Available
24
 
@@ -26,3 +26,6 @@ The collection includes a total of 26 files:
26
  * Baseline Formats: BF16, FP16, and FP32
27
  * Quantization Types: 23 different levels including IQ1_S, IQ2_XXS, Q4_K_M, and Q8_0
28
  * Importance Matrix: `imatrix.gguf` is included for reference or custom use
 
 
 
 
18
 
19
  # MiniCPM5-1B-i1-GGUF
20
 
21
+ This repository contains GGUF format quantizations of the model openbmb/MiniCPM5-1B. These models were generated using llama.cpp with an Importance Matrix (imatrix) - https://gist.github.com/bartowski1182/82ae9b520227f57d79ba04add13d0d0d
22
 
23
  ## Models Available
24
 
 
26
  * Baseline Formats: BF16, FP16, and FP32
27
  * Quantization Types: 23 different levels including IQ1_S, IQ2_XXS, Q4_K_M, and Q8_0
28
  * Importance Matrix: `imatrix.gguf` is included for reference or custom use
29
+
30
+ # NOTE
31
+ Quantization below 4 bits may result in unacceptably low quality, particularly for a 1B parameter model, which is inherently more sensitive to such compression.