Naphula commited on
Commit
ba43cbb
·
verified ·
1 Parent(s): 48fdc28

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -8,4 +8,8 @@ this is the setup I use with 8gb vram
8
 
9
  `koboldcpp.exe --host 0.0.0.0 --port 5001 --model Q3_K_M-GGUF-00001-of-00006.gguf --flashattention --contextsize 57344 --gpulayers 22 --moecpu`
10
 
 
 
 
 
11
  this model is really good but i recommend mlabonne's abliterated gemma 3 27B for cases when GLM struggles with prompt adherence
 
8
 
9
  `koboldcpp.exe --host 0.0.0.0 --port 5001 --model Q3_K_M-GGUF-00001-of-00006.gguf --flashattention --contextsize 57344 --gpulayers 22 --moecpu`
10
 
11
+ or
12
+
13
+ `koboldcpp.exe --host 0.0.0.0 --port 5001 --model Q3_K_M-GGUF-00001-of-00006.gguf --flashattention --contextsize 16384 --gpulayers 48 --moecpu`
14
+
15
  this model is really good but i recommend mlabonne's abliterated gemma 3 27B for cases when GLM struggles with prompt adherence