Inference Providers
Active filters: nvidia
nvidia/DeepSeek-V4-Pro-NVFP4
Text Generation
• 910B • Updated • 81k
• 62
cHunter789/Qwen3.6-27B-i1-IQ4_KS-GGUF
Text Generation
• 27B • Updated • 5.43k
• 18
AEON-7/Gemma-4-12B-it-AEON-Abliterated-K4-BF16
Text Generation
• 12B • Updated • 2.62k
• 25
r0b0tlab/nex-n2-mini-nvfp4
Text Generation
• 18B • Updated • 1.5k
• 6
nvidia/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 448k
• 495
nvidia/Nemotron-Labs-Diffusion-3B
Text Generation
• 4B • Updated • 36.2k
• 31
nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16
Text Generation
• 4B • Updated • 798k
• 93
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 30k
• 505
nvidia/Kimodo-SOMA-RP-v1.1
0.3B • Updated • 1.73k
• 25
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8
Any-to-Any
• 33B • Updated • 35.1k
• 54
mlx-community/LocateAnything-3B-8bit
Image-Text-to-Text
• 1B • Updated • 404
• 4
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-Base-BF16
Text Generation
• 561B • Updated • 2.22k
• 25
nvidia/nemotron-3-8b-base-4k
Text Generation
• Updated • 1
• 105
nvidia/OpenMath-CodeLlama-70b-Python-hf
Text Generation
• 69B • Updated • 23
• 12
nvidia/Llama-3.1-Minitron-4B-Width-Base
Text Generation
• 5B • Updated • 1.6k
• 195
nvidia/Nemotron-Mini-4B-Instruct
Text Generation
• Updated • 292k
• 183
bartowski/Open-Insurance-LLM-Llama3-8B-GGUF
Text Generation
• 8B • Updated • 352
• 6
nvidia/Cosmos-1.0-Guardrail
Updated • 2.84k
• 62
nvidia/Cosmos-Transfer1-7B
Updated • 357
• 65
bartowski/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
• 8B • Updated • 1.08k
• 11
Image-Text-to-Text
• 8B • Updated • 22.2k
• 242
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
Text Generation
• 5B • Updated • 8.72k
• 115
nvidia/AceReason-Nemotron-7B
Text Generation
• 8B • Updated • 4.16k
• • 22
nvidia/Cosmos-Embed1-448p
1B • Updated • 14.1k
• 12
nvidia/Cosmos-Predict2-2B-Sample-Action-Conditioned
Updated • 37
• 10
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 42k
• 34
nvidia/Cosmos-Transfer2.5-2B
Updated • 7.65k
• 66
nvidia/Cosmos-Predict2.5-2B
Updated • 61.4k
• 134
cyankiwi/Llama-3_3-Nemotron-Super-49B-v1_5-AWQ-4bit
Text Generation
• 8B • Updated • 202
• 4
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 5.37k
• 33