GRM-2.6-Plus / .eval_results /swe_bench_verified.yaml
DedeProGames's picture
Fix benchmark
3f5fcc3 verified
raw
history blame contribute delete
241 Bytes
- dataset:
id: SWE-bench/SWE-bench_Verified
task_id: swe_bench_%_resolved
value: 77.7
date: '2026-04-23'
source:
url: https://huggingface.co/OrionLLM/GRM-2.6-Plus
name: Official GRM-2.6 Benchmark
user: DedeProGames