- dataset: id: harborframework/terminal-bench-2.0 task_id: terminalbench_2 value: 59.8 date: '2026-04-23' source: url: https://huggingface.co/OrionLLM/GRM-2.6-Plus name: Official GRM-2.6 Benchmark user: DedeProGames