The title seems misleading, and reading the article explains the reason more clearly. There's nonsense OKR's and objectives at these companies to burn as many tokens as possible. It turns out that when you make a metric out of token usage, it unsurprisingly ends up becoming extremely expensive.
Inference is affordable, and you don't need a SOTA proprietary model to get a lot of use out of this technology. While you likely will still need a human engineer for quite a while longer, I don't agree that some number of humans + an LLM is going to be (or will ever remain) more expensive than just hiring more humans.