Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

752

Base only

Active filters: rlvr

lxazjk/qwen2.5-1.5b-24game-grpo

Text Generation • 2B • Updated 17 days ago • 308 • 1

artichoke42/Qwen3.6-27B-KR-MTP-GGUF

Text Generation • 3.05M • Updated 12 days ago • 555 • 1

beyoru/seul-preview

Text Generation • 9B • Updated about 1 hour ago • 638 • 1

SultanR/SmolTulu-1.7b-Reinforced-GGUF

Text Generation • 2B • Updated Dec 17, 2024 • 7 • 1

thuml/rt1-world-model-multi-step-rlvr

0.1B • Updated May 26, 2025 • 15

thuml/rt1-world-model-single-step-rlvr

0.1B • Updated May 26, 2025 • 15

thuml/webarena-world-model-rlvr

2B • Updated May 26, 2025 • 12

thuml/bytesized32-world-model-rlvr-binary-reward

2B • Updated May 26, 2025 • 16

thuml/bytesized32-world-model-rlvr-task-specific-reward

2B • Updated May 26, 2025 • 14

DebateLabKIT/Llama-3.1-Argunaut-1-8B-HIRPO

Text Generation • 8B • Updated Jul 24, 2025 • 13 • 1

thinkwee/NOVER1-Qwen3-4B

Question Answering • 4B • Updated Aug 20, 2025 • 3 • 2

thinkwee/NOVER1-Qwen2.5-7B

Question Answering • 8B • Updated Aug 20, 2025 • 2 • 2

mradermacher/NOVER1-Qwen3-4B-GGUF

4B • Updated Aug 21, 2025 • 118 • 1

mradermacher/NOVER1-Qwen2.5-7B-GGUF

8B • Updated Aug 21, 2025 • 218 • 1

mradermacher/NOVER1-Qwen3-4B-i1-GGUF

4B • Updated Dec 16, 2025 • 115 • 1

mradermacher/NOVER1-Qwen2.5-7B-i1-GGUF

8B • Updated Dec 16, 2025 • 623 • 1

DebateLabKIT/Phi-4-Argunaut-1-HIRPO

Text Generation • 415k • Updated Dec 2, 2025 • 12

mradermacher/Llama-3.1-Argunaut-1-8B-HIRPO-GGUF

8B • Updated Sep 18, 2025 • 125 • 1

mradermacher/Llama-3.1-Argunaut-1-8B-HIRPO-i1-GGUF

8B • Updated Dec 10, 2025 • 107 • 1

fangwu97/DeepSearch-1.5B

Text Generation • 2B • Updated Oct 20, 2025 • 32 • 9

ziadrone/airesupdated-v2

Text Generation • 4B • Updated Oct 28, 2025 • 6 • 1

mradermacher/airesupdated-v2-GGUF

Reinforcement Learning • 4B • Updated Oct 24, 2025 • 53

ABaroian/Apertus-8B-RLVR-GSM

Text Generation • Updated Dec 3, 2025 • 2

Anonymouslolol/qwen3-8B-hanabi-step110

Reinforcement Learning • Updated Oct 24, 2025 • 1

beyoru/MaxCoder-4B

Text Generation • 4B • Updated Nov 7, 2025 • 1

anonymousatom/IntelliAsk-Qwen3-32B-450-Merged

Text Generation • 33B • Updated Mar 9 • 7

mradermacher/IntelliAsk-Qwen3-32B-450-Merged-GGUF

Reinforcement Learning • 33B • Updated Apr 8 • 167

mradermacher/Phi-4-Argunaut-1-HIRPO-GGUF

15B • Updated Dec 3, 2025 • 30

mradermacher/Phi-4-Argunaut-1-HIRPO-i1-GGUF

15B • Updated Dec 4, 2025 • 398

pankajmathur/RenCoder-Devstral-Small-2507

Text Generation • 24B • Updated Apr 10 • 17 • 1