Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
OpenAI's GDPval: Why the 66% in Automated Grading Matters More Than 48% Win Rate | Better HN
OpenAI's GDPval: Why the 66% in Automated Grading Matters More Than 48% Win Rate
(opens in new tab)
(medium.com)
7 points
pdasika
7mo ago
2 comments
Share
2 comments
default
newest
oldest
adisv
7mo ago
Very comprehensive writeup @pdasika. Incredibly relevant for devs working on agentic applications for the enterprise.
kanodiaashu
7mo ago
Interesting take..
j
/
k
navigate · click thread line to collapse