undefined | Better HN

0 pointsskywhopper1y ago0 comments

"The fact that scaled reasoning models are finally showing progress on ARC proves that what it measures really is relevant and important for reasoning."

Not sure I understand how this follows. The fact that a certain type of model does well on a certain benchmark means that the benchmark is relevant for a real-world reasoning? That doesn't make sense.

0 comments

munchler1y ago

It shows objectively that the models are getting better at some form of reasoning, which is at least worth noting. Whether that improved reasoning is relevant for the real world is a different question.

1 more reply

bagels1y ago

It doesn't follow, faulty logic. The two are probably correlated though.

j / k navigate · click thread line to collapse

0 comments

munchler1y ago

1 more reply

bagels1y ago

It doesn't follow, faulty logic. The two are probably correlated though.

j / k navigate · click thread line to collapse