Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

9

Base only

Active filters: reinforcement

iso-ai1/isopro

Updated Nov 1, 2024

leorc/Simulus

Reinforcement Learning • Updated Feb 21, 2025 • 1

Daemontatox/Zireal-0

Text Generation • 684B • Updated Jul 1, 2025 • 59 • 1

mradermacher/Zireal-0-GGUF

Updated Jul 31, 2025 • 1

SakanaAI/RLT-7B

Text Generation • 8B • Updated Jun 22, 2025 • 13 • • 19

SakanaAI/RLT-32B

Text Generation • 33B • Updated Jun 22, 2025 • 7 • 7

StephenGenusa/RLT-32B-Q5_0-GGUF

Text Generation • 33B • Updated Jun 24, 2025 • 6

Sam2x/SakanaAI-RLT-7B-GGUF

Text Generation • 8B • Updated Jun 30, 2025 • 66 • 1

wahidmounir/RLT-32B-Q4_K_M-GGUF

Text Generation • 33B • Updated Mar 1 • 5