undefined | Better HN

0 pointsarrowsmith1mo ago0 comments

What about 5.1 do you prefer over 5.2?

0 comments

As far as I can tell 5.2 is the stronger model on paper, but it's been optimized to think less and do less web searches. I daily drive Thinking variants, not Auto or Instant, and usually want the _right_ answer even if it takes a minute. 5.1 does a very good job of defensively web searching, which avoids almost all of its hallucinations and keeps docs/APIs/UIs/etc up-to-date. 5.2 will instead often not think at all, even in Thinking mode. I've gotten several completely wrong, hallucinated answers since 5.2 came out, whereas maybe a handful from 5.1. (Even with me using 5.2 far less!)

The same seems to persist in Codex CLI, where again 5.2 doesn't spend as much time thinking so its solutions never come out as nicely as 5.1's.

That said, 5.1 is obviously slower for these reasons. I'm fine with that trade off. Others might have lighter workloads and thus benefit more from 5.2's speed.

Terretta1mo ago

This is a terrible thing to say out loud*, but, in all such cases I'd rather just give them the more money to do the better answers.

It boggles the mind that "wrong answers only" is no longer just a meme, it's considered a valid cost management strategy in AI.

* Because if they realize we're out here, they'll price discriminate, charging extra for right answers.

j / k navigate · click thread line to collapse