Add evaluation results (including Claw-eval)

#6
by SaylorTwift HF Staff - opened
Files changed (1) hide show
  1. .eval_results/Ornith-1.0-397B.yaml +26 -0
.eval_results/Ornith-1.0-397B.yaml ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: SWE-bench/SWE-bench_Verified
3
+ task_id: swe_bench_%_resolved
4
+ value: 82.4
5
+ date: "2026-06-25"
6
+ source:
7
+ url: https://huggingface.co/deepreinforce-ai/Ornith-1.0-397B
8
+ name: "Ornith-1.0-397B model card"
9
+
10
+ - dataset:
11
+ id: ScaleAI/SWE-bench_Pro
12
+ task_id: SWE_Bench_Pro
13
+ value: 62.2
14
+ date: "2026-06-25"
15
+ source:
16
+ url: https://huggingface.co/deepreinforce-ai/Ornith-1.0-397B
17
+ name: "Ornith-1.0-397B model card"
18
+
19
+ - dataset:
20
+ id: claw-eval/Claw-Eval
21
+ task_id: general
22
+ value: 77.1
23
+ date: "2026-06-25"
24
+ source:
25
+ url: https://huggingface.co/deepreinforce-ai/Ornith-1.0-397B
26
+ name: "Ornith-1.0-397B model card"