Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Evaluating Multimodal LLMs Using the Google IO 2024 Puzzle | Better HN
Evaluating Multimodal LLMs Using the Google IO 2024 Puzzle
(opens in new tab)
(hiresynth.ai)
3 points
simonbutt
2y ago
2 comments
Share
2 comments
default
newest
oldest
malet
2y ago
Surprising to see these models stumbling on what at first glance seems like a simple task, it would be interesting to see how the non-vision models fare if you convert the problems to ascii art
simonbutt
OP
2y ago
GPT-4V, Claude 3 Opus and Gemini Ultra go head to head in solving GoogleIO Puzzle 2024
j
/
k
navigate · click thread line to collapse