undefined | Better HN

0 pointsbangaladore1y ago0 comments

It is a debut of their thinking mode iirc.

Unfortunately LLMs are shifting compute time to test time instead of train time. I don't really like this and frankly it shows a stalling of the architectures, data sets, etc...

0 comments

minihat1y ago

Another take is that the base models are now good enough that spending more money for more intelligence is viable at test time. A threshold has been crossed.

bangaladoreOP1y ago

I guess I'd always thought the direct opposite.

Naively, I feel to be useful, the goal of LLMs should be to more power efficient. So that eventually all devices can be smarter.

Power efficiency can be gained through less time-time, or more "intelligence" or some combination of the two. I'm not convinced these SOTA models are doing much more than increasing test-time.

holoduke1y ago

Biggest impacts on power efficiency will be the advances in node size and transistor type like nanosheet or forksheet. Algorithm will help just a little.

j / k navigate · click thread line to collapse