The difference between serving 1 and 1 billion http queries is not the same as the difference between generating 1 and 1 billion tokens.
The startup blitz-scaling-market-capturing playbook makes makes sense when you spend to scale, not when you spend because you scale, yeah, I understand that step 2 is "and now you squeeze the users", but it will need to be by such a bigger factor...