How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull inference-snaps/NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q4_K_M-5GB:UD-Q4_K_M
Run and chat with the model
lemonade run user.NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q4_K_M-5GB-UD-Q4_K_M
List all available models
lemonade list
Quick Links

No model card

Downloads last month
13
GGUF
Model size
121B params
Architecture
nemotron_h_moe
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support