- dataset: id: TIGER-Lab/MMLU-Pro task_id: mmlu_pro value: 80.1 source: url: https://huggingface.co/OrionLLM/GRM-2.5/ name: Official GRM-2.5 Benchmark user: DedeProGames