99.99% of people cannot run these models on their own hardware, they are forced to rent it from someone. That someone is almost always the big China players themselves anyways.
Why else is Qwen now having cloud-only models?
Model - Deepseek V4 Pro
CHEAPEST PROVIDER: Provider: Deepseek Input Price - $0.435/M tokens Output Price - $0.87/M tokens Cache Read - $0.003625/M tokens
SECOND CHEAPEST: Provider: deepinfra Input Price - $1.30/M tokens Output Price - $2.60/M tokens Cache Read - $0.10/M tokens
Deepinfra is almost 3x more expensive and they are using a fp4 model, with Max 16.4K output (vs 364K) and have significantly lower throughput!
I mean FFS a single hyper scale datacenter can provide free school lunches for a year. Something tells me the economic output of making sure children are fed is way higher than whether Zuckerberg can own another Hawaiian island by allowing people to be scammed by LLMs.
I’m an American person yet I’m not public property.