How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull ggml-org/granite-4.0-h-small-Q8_0-GGUF:Q8_0
Run and chat with the model
lemonade run user.granite-4.0-h-small-Q8_0-GGUF-Q8_0
List all available models
lemonade list
Quick Links

ggml-org/granite-4.0-h-small-Q8_0-GGUF

This model was converted to GGUF format from ibm-granite/granite-4.0-h-small using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Downloads last month
44
GGUF
Model size
32B params
Architecture
granitehybrid
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ggml-org/granite-4.0-h-small-Q8_0-GGUF

Quantized
(36)
this model