Image-Text-to-Text
Transformers
GGUF
English
conversational
langdaohlb commited on
Commit
da058f7
·
verified ·
1 Parent(s): e6e5075

Add ZwZ-2B-GGUF model weights

Browse files
.gitattributes CHANGED
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ ZwZ-2B-F16.gguf filter=lfs diff=lfs merge=lfs -text
37
+ ZwZ-2B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
38
+ ZwZ-2B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
39
+ mmproj-ZwZ-2B-F16.gguf filter=lfs diff=lfs merge=lfs -text
40
+ mmproj-ZwZ-2B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
41
+ mmproj-ZwZ-2B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
REDME.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ **ZwZ-2B-GGUF**
2
+
3
+ This repository provides GGUF-format weights for [ZwZ-2B](https://huggingface.co/inclusionAI/ZwZ-2B), split into two components:
4
+
5
+ - Language model (LLM): FP16, Q8_0, Q4_K_M
6
+ - Vision encoder (mmproj): FP16, Q8_0, Q4_K_M
7
+
8
+ These files are compatible with llama.cpp, Ollama, and other GGUF-based tools, supporting inference on CPU, NVIDIA GPU (CUDA), Apple Silicon (Metal), Intel GPUs (SYCL), and more. You can mix precision levels for the language and vision components based on your hardware and performance needs, and even perform custom quantization starting from the FP16 weights.
9
+
10
+ Enjoy running this multimodal model on your personal device! 🚀
ZwZ-2B-F16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2aa2db0e35a67a37e4c695a9310ff3d53e0dbbfe2082e411a6a5561e32301434
3
+ size 4069680064
ZwZ-2B-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e6db5b045a184200e2bcf2bb875921fdfdc31044110974112b484b5726115df7
3
+ size 1282440128
ZwZ-2B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:335b075e96966003832826fb17dffb17b179af9c9efcdbb318fe6578675d35e9
3
+ size 2165040064
mmproj-ZwZ-2B-F16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aac8c055e7c641c590530fa15b4c576d491dd0f3af0e66790250e1244d36663f
3
+ size 819394688
mmproj-ZwZ-2B-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a63f1091267b600401fd4d815c504d2889741bd335d6cc8bfba17566e24c27b
3
+ size 275970176
mmproj-ZwZ-2B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9037f53b860bd4dad00833c90e43bbd03ee5dd14954e52a00f09a174075088f9
3
+ size 441907328