Instruction-Following Evaluation for Large Language Models Paper • 2311.07911 • Published Nov 14, 2023 • 22
vectara/hallucination_evaluation_model Text Classification • 0.1B • Updated Oct 20, 2025 • 83.7k • 355
Runtime error Agents Featured 437 Open Medical-LLM Leaderboard 🥇 437 Explore and submit models for benchmarking
Running on CPU Upgrade Agents 76 La Leaderboard 🌸 76 Evaluate open LLMs in the languages of LATAM and Spain.