That was the biggest surprise to me -- they are really, appreciably worse! Why?
One thing that comes to mind is that AI labs are increasingly specializing models for coding and, to a lesser degree, white-collar work in general (writing summaries, reports, etc.), and maybe that comes at the cost of other, unrelated capabilities.