How to use from
Unsloth Studio
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for ggml-org/granite-4.0-h-small-Q8_0-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for ggml-org/granite-4.0-h-small-Q8_0-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for ggml-org/granite-4.0-h-small-Q8_0-GGUF to start chatting
Quick Links

ggml-org/granite-4.0-h-small-Q8_0-GGUF

This model was converted to GGUF format from ibm-granite/granite-4.0-h-small using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Downloads last month
44
GGUF
Model size
32B params
Architecture
granitehybrid
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ggml-org/granite-4.0-h-small-Q8_0-GGUF

Quantized
(36)
this model