Granite-4.1-8B β€” Q4_K_M for IBM Power (Linux ppc64le + AIX)

Granite-4.1-8B quantized to Q4_K_M with a Q6_K output head for fast CPU inference on IBM Power β€” POWER9 (VSX) and POWER10/11 (MMA-accelerated) via LibrePower. No GPU required. Size: 5.0G.

Run it

Ubuntu / Debian ppc64le:

curl -fsSL https://linux.librepower.org/install.sh | sudo sh
sudo apt install librepower-llama
wget https://huggingface.co/librepowerai/Granite-4.1-8B-Power/resolve/main/Granite-4.1-8B-Q4_K_M.gguf
lp-llama-completion -m Granite-4.1-8B-Q4_K_M.gguf -p "Hello!" -n 64 -t $(nproc)

IBM AIX 7.3 (big-endian):

dnf install llama-aix
wget https://huggingface.co/librepowerai/Granite-4.1-8B-Power/resolve/main/Granite-4.1-8B-Q4_K_M-be.gguf
lp-llama-completion -m Granite-4.1-8B-Q4_K_M-be.gguf -p "Hello!" -n 64 -t $(nproc)

Files

  • Granite-4.1-8B-Q4_K_M.gguf β€” little-endian (Ubuntu/Linux ppc64le)
  • Granite-4.1-8B-Q4_K_M-be.gguf β€” big-endian (IBM AIX)

Good for

IBM 8B dense, signed, ISO-42001: enterprise RAG, tool-calling, structured JSON, code

Credits

Base model by its original authors (Apache-2.0). Quantization & Power packaging: LibrePower.

Downloads last month
43
GGUF
Model size
9B params
Architecture
granite
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for librepowerai/Granite-4.1-8B-Power

Quantized
(47)
this model