Inference Providers
Active filters: cuda
prism-ml/bonsai-image-ternary-4B-gemlite-2bit
Text-to-Image
• Updated • 41
• 98
prism-ml/bonsai-image-binary-4B-gemlite-1bit
Text-to-Image
• Updated • 16
• 36
Text Generation
• 8B • Updated • 59.9k
• 704
ussoewwin/Flash-Attention-2_for_Windows
Updated • 106
Text Generation
• 4B • Updated • 6.61k
• 46
ESpeech/milfer_denoiser_v1.0
Audio-to-Audio
• Updated • 2
Text Generation
• Updated • 2
EvanOLeary/laguna-xs2-dense-k8-cuda-grpo
Text Generation
• 3B • Updated • 105
• 1
EvanOLeary/laguna-xs2-dense-k8-cuda-dpo
Text Generation
• 3B • Updated • 42
• 1
AngelWarmSmile123/Dania-Cute-Whisper123
Text Generation
• Updated • 1
Text Generation
• Updated • 10
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 5
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated • 28
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated • 2.23k
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated • 649
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated • 10
• • 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated • 162
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated • 436
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated • 46
• • 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated • 6
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated • 5
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated • 407
• 2
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated • 135
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated • 171
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated • 347
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated • 268
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-i1-GGUF
3B • Updated • 320
• 1
mradermacher/Qwen3-Shining-Valiant-Instruct-Fast-CODER-Reasoning-2.4B-GGUF
2B • Updated • 186