The same seems to persist in Codex CLI, where again 5.2 doesn't spend as much time thinking so its solutions never come out as nicely as 5.1's.
That said, 5.1 is obviously slower for these reasons. I'm fine with that trade off. Others might have lighter workloads and thus benefit more from 5.2's speed.
It boggles the mind that "wrong answers only" is no longer just a meme, it's considered a valid cost management strategy in AI.
* Because if they realize we're out here, they'll price discriminate, charging extra for right answers.