GRM-2.6-Plus / .eval_results /mmlu_pro.yaml
DedeProGames's picture
Update .eval_results/mmlu_pro.yaml
eaf06a8 verified
raw
history blame contribute delete
198 Bytes
- dataset:
id: TIGER-Lab/MMLU-Pro
task_id: mmlu_pro
value: 86.8
source:
url: https://huggingface.co/OrionLLM/GRM-2.6-Plus
name: Official GRM-2.6 Benchmark
user: DedeProGames