davidlvxin commited on
Commit
4d67f66
·
verified ·
1 Parent(s): 120edc2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -44,7 +44,7 @@ We're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It
44
  |Reasoning|||||||||||
45
  |HLE|40.5|31|41.4|37|37.7|49.8*|41.4*|45|
46
  |HLE (w/ Tools)|54.7|52.3|53.5|-|48.2|57.9*|52.2*|51.4*|
47
- |CritPt|16.7|4.6|13.4|3.7|12.9|20.9|27.1|17.7|
48
  |AIME 2026|99.2|95.3|97|-|94.6|95.7|98.3|98.2|
49
  |HMMT Nov. 2025|94.4|94|95|84.4|94.4|96.5|96.5|94.8|
50
  |HMMT Feb. 2026|92.5|82.6|97.1|84.4|95.2|96.7|96.7|87.3|
 
44
  |Reasoning|||||||||||
45
  |HLE|40.5|31|41.4|37|37.7|49.8*|41.4*|45|
46
  |HLE (w/ Tools)|54.7|52.3|53.5|-|48.2|57.9*|52.2*|51.4*|
47
+ |CritPt|20.9|4.6|13.4|3.7|12.9|20.9|27.1|17.7|
48
  |AIME 2026|99.2|95.3|97|-|94.6|95.7|98.3|98.2|
49
  |HMMT Nov. 2025|94.4|94|95|84.4|94.4|96.5|96.5|94.8|
50
  |HMMT Feb. 2026|92.5|82.6|97.1|84.4|95.2|96.7|96.7|87.3|