Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
NewsaHackO
4mo ago
0 comments
Share
>establish benchmarks that make sense and are reliable
How aren't current LLM coding benchmarks reliable?
0 comments
default
newest
oldest
Papazsazsa
4mo ago
They're manipulated.
NewsaHackO
OP
4mo ago
Unless you are going to be more specific, that criticism applies to all benchmarks that are connected to a positive gain, not just AI coding benchmarks.
j
/
k
navigate · click thread line to collapse