Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
0 points
tikotus
3mo ago
0 comments
Share
Here's someone else testing models on a daily logic puzzle (Clues by Sam):
https://www.nicksypteras.com/blog/cbs-benchmark.html
GPT 5 Pro was the winner already before in that test.
undefined | Better HN
0 comments
default
newest
oldest
thanhhaimai
3mo ago
This link doesn't have Gemini 3 performance on it. Do you have an updated link with the new models?
dezgeg
3mo ago
I've also tried Gemini 3 for Clues by Sam and it can do really well, have not seen it make a single mistake even for Hard and Tricky ones. Haven't run it on too many puzzles though.
crapple8430
3mo ago
GPT 5 Pro is a good 10x more expensive so it's an apples to oranges comparison.
j
/
k
navigate · click thread line to collapse