Why would Microsoft subsidize Anthropic's models when they serve the Claude model on Azure? They charge the same price as Anthropic. They aren't an investor in Anthropic.
There are numerous independent model serving companies that are clearly profitable serving non-Frontier models (Kimi K2.5 etc). It's easy to work out the raw costs of B200 GPUs, and then see what you need to charge for an API and see they make money.
The frontier labs charge a lot more than these companies.
The frontier labs have said they are profitable on inference.
Most people believe that training (and maybe subscriptions for some users) is where they lose money. Why do you think otherwise?