Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
0 points
grantpitt
4mo ago
0 comments
Share
Agreed, it also leads performance on arc-agi-1. Here's the leaderboard where you can toggle between arc-agi-1 and 2:
https://arcprize.org/leaderboard
undefined | Better HN
0 comments
default
newest
oldest
energy123
4mo ago
It leads on arc-agi-1 with Gemini 3.0 Deep Think, which uses "tool calls" according to google's post, whereas regular Gemini 3.0 Pro doesn't use "tool calls" for the same benchmark. I am unsure how significant this difference is.
j
/
k
navigate · click thread line to collapse