Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
HWE Bench: A new unbounded Benchmark for LLMs (GPT 5.5 is on top) | Better HN
HWE Bench: A new unbounded Benchmark for LLMs (GPT 5.5 is on top)
(opens in new tab)
(hwebench.com)
6 points
fesens
11d ago
3 comments
Share
3 comments
default
newest
oldest
fesens
OP
11d ago
Current benchmarks have ceilings, usually 100%. This benchmark aims to be a long lasting, high correlation with the ability to solve real world problems and follow complex instructions, and unbounded (meaning it can always go higher).
paulobeckhauser
10d ago
Very nice!!
fabiofachini92
11d ago
Amazing!
j
/
k
navigate · click thread line to collapse