undefined | Better HN

0 pointsandruby8mo ago0 comments

Unlimited works well for everything that is “too cheap to meter”.

Internet, text messages, etc are roughly that: the direct costs are so cheap.

That’s not the case with LLM’s at this moment. There are significant direct costs to each long-running agent.

0 comments

Internet and SMS used to be expensive and metered until they weren't thanks to technological advances and expanded use. I think LLMs will follow the same path, maybe on a shorter timespan.

cmsjustin8mo ago

They were not expensive to operate, they were only expensive for consumers

tialaramex8mo ago

Right, that's crucial to understand. In 1985 you could make a direct dial from England to the US but it was eye wateringly expensive. £2 per minute. An hour's call to your mum? That's over £100.

But the cost to Bell and British Telecom was not £2 per minute, or £1 per minute, or even 1p per minute, it was nothing at all. Their costs were not for the call, but for the infrastructure over which the call was delivered, a transatlantic cable. If there was one call for ten minutes, once a week essentially at random, that cable must still exist, but if there are 10 thousand call minutes per week, a thousand times more, it's the same cable.

So the big telcos all just picked a number and understood it as basically free income. If everybody agrees this call costs £2 then it costs £2 right, and those 10 thousand call minutes generate a Million pound annual income.

It's maybe easier for Americans to understand if you tell them that outside the US the local telephone calls cost money back then. Why were your calls free? Because why not, the decision to charge for the calls is arbitrary, the calls don't actually cost anything, but you will need to charge somehow to recoup the maintenance costs. In the US the long distance calls were more expensive to make up for this for a time, today it's all absorbed in a monthly access fee on most plans.

2 more replies

KaiserPro8mo ago

To lay the cables required a huge amount of capital, to make that feasible its required financial engineering. That translates to high operating expenses.

1 more reply

hkt8mo ago

Competition is the thing. Prices will drop as more AI code assistants get more adoption.

Prices will probably also drop if anyone ever works out how to feasibly compete with NVIDIA. Not an expert here, but I expect they're worried about competition regulators, who will be watching them very closely.

2 more replies

alwillis8mo ago

Yes and no.

It’s very expensive to create these models and serve them at scale.

Eventually the processing power required to create them will come down, but that’s going to be a while.

Even if there was a breakthrough GPU technology announced tomorrow, it would take several years before it could be put into production.

And pretty much only TSMC can produce cutting edge chips at scale and they have their hands full.

Between Anthropic, xAI and OpenAI, these companies have raised about $84 billion dollars in venture capital… VCs are going to want a return on their investment.

So it’s going to be a while…

margalabargala8mo ago

SMS was designed from the start to fit in the handul of unused bytes in the tower handshake that was happening anyway, hence the 160 char limit. Its marginal cost has always been free on the supply side.

RF_Savage8mo ago

SMS routing and billing systems did cost money. Especially billing, as the standards had nothing for it, so it was done by 3rd party software for a very long time.

1 more reply

xtracto8mo ago

I think LLMs follow more of an Energy analogy: Gas or Electricity, or even water.

How much has any if these decreased over the last 5 decades? The problem is that as of right now, LLM cost is linearly (if not exponentially) related to the output. It's basically "transferring energy" converted into bytes. So unless we see some breakthrough in energy generation, or better use it, it will be difficult to scale.

This makes me wonder, would it be possible to pre-compue some kind of "rainbow tables" equivalent for LLMs? Either stored in the client or in the server; so as to reduce the computing needed for inference.

valenterry8mo ago

I don't think so. Yes, LLMs use electricity. But they use electricity in the data-center, not in your home. That's very different, because it's cheap to transfer tokens from the data-center to your home, but it's not cheap to transfer electricity from the data-center to your home. And that matters, because we can build a data-center in a place where there's lots of renewable and hence cheap energy (e.g. from solar or from water/wind).

If you think about it, LLMs are used mostly when people are awake, at least right now. And when is the sun shining? Right. So, build a data-center somewhere where land is cheap and lots of solar panels can be build right next to it. Sure, some other energy source will be used for stability etc., but it won't be as expensive as the energy price for your home.

> This makes me wonder, would it be possible to pre-compue some kind of "rainbow tables" equivalent for LLMs?

Already happening. Read up on how those companies do caching prompt-prefixes etc.

beefnugs8mo ago

Isn't it the exact opposite? No one is making profit yet, it is a mad dash to monopolize the market, it has to get more expensive to ever turn profit, so the screws will turn

jlaternman7mo ago

Yes! I agree completely. They’ve not even turned on the money faucets yet. These prices are likely just to hook users on the product, and will be more comparable to paying something that compares, but favourably, to minimum wage per hour in the future. Not implying a nefarious scheme, I just think that’s how the economics of it will pan out.

whimsicalism8mo ago

maybe, but they are not nearly as comparable as you’re making it out to be

MuffinFlavored8mo ago

> That’s not the case with LLM’s at this moment.

I'd be curious to know how many tokens the average $200/mo user uses and what the cost on their end for it is.

j / k navigate · click thread line to collapse

0 comments

rmujica8mo ago

Internet and SMS used to be expensive and metered until they weren't thanks to technological advances and expanded use. I think LLMs will follow the same path, maybe on a shorter timespan.

cmsjustin8mo ago

They were not expensive to operate, they were only expensive for consumers

tialaramex8mo ago

Right, that's crucial to understand. In 1985 you could make a direct dial from England to the US but it was eye wateringly expensive. £2 per minute. An hour's call to your mum? That's over £100.

2 more replies

KaiserPro8mo ago

To lay the cables required a huge amount of capital, to make that feasible its required financial engineering. That translates to high operating expenses.

1 more reply

hkt8mo ago

Competition is the thing. Prices will drop as more AI code assistants get more adoption.

2 more replies

alwillis8mo ago

Yes and no.

It’s very expensive to create these models and serve them at scale.

Eventually the processing power required to create them will come down, but that’s going to be a while.

Even if there was a breakthrough GPU technology announced tomorrow, it would take several years before it could be put into production.

And pretty much only TSMC can produce cutting edge chips at scale and they have their hands full.

Between Anthropic, xAI and OpenAI, these companies have raised about $84 billion dollars in venture capital… VCs are going to want a return on their investment.

So it’s going to be a while…

margalabargala8mo ago

RF_Savage8mo ago

SMS routing and billing systems did cost money. Especially billing, as the standards had nothing for it, so it was done by 3rd party software for a very long time.

1 more reply

xtracto8mo ago

I think LLMs follow more of an Energy analogy: Gas or Electricity, or even water.

valenterry8mo ago

> This makes me wonder, would it be possible to pre-compue some kind of "rainbow tables" equivalent for LLMs?

Already happening. Read up on how those companies do caching prompt-prefixes etc.

beefnugs8mo ago

Isn't it the exact opposite? No one is making profit yet, it is a mad dash to monopolize the market, it has to get more expensive to ever turn profit, so the screws will turn

jlaternman7mo ago

whimsicalism8mo ago

maybe, but they are not nearly as comparable as you’re making it out to be

MuffinFlavored8mo ago

> That’s not the case with LLM’s at this moment.

I'd be curious to know how many tokens the average $200/mo user uses and what the cost on their end for it is.

j / k navigate · click thread line to collapse