See Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart...
See here: https://epoch.ai/data-insights/llm-inference-price-trends
LLMs are so comically inefficient compared to the human brain that it is pretty easy to imagine this trend continuing for several more 90% drops.
If LeCun's JEPA or GRAM turn out to be a thing, we could see a 3-4 order of magnitude drop in a single release cycle / generation.
Keep in mind that performance per watt on the hardware side - at the same time - is still doubling every ~24 months - and this doesn't factor that in.
No comments yet.