undefined | Better HN

0 pointsldjkfkdsjnv1y ago0 comments

Its inevitable. You can argue about what's possible right now, but I'm not looking at it from that angle. I think these issues will be solved with time

0 comments

ryan931y ago

They are using infinity compute and can’t do simple notifications. How will changing the architecture slightly or ingesting more data change that?

rglover1y ago

That belief is at odds with the mechanics of how LLMs work. It's not a question of more effort/investment/compute/whatever, it's just a reality of how the underlying systems work (non-deterministic). If you can find a way to make the context window on the scale of the human brain, you may be able to mostly mitigate this.

People want us to be at "Her" levels of AI, but we're at a far earlier stage. We can fake certain aspects of that (using TTS), but blindly trusting an AI to run everything is going to be a big mistake in the short-term. And in order for the inevitability of what you describe to take place, the predecessor(s) to that have to work in a way that doesn't scare people and businesses away.

The plowing of money and hype into the current forms of AI (not to mention the gaslighting about their ability) makes me think the real inevitability is a meltdown in the next 5-10 years which leads to AI-hesitancy on a mass scale.

ldjkfkdsjnvOP1y ago

Have you tried o1 pro? I find people that are making these assertions are not deeply using the models on a daily basis. With each new release, I can see the increase of capability, and can do things. I have written software in the last year that is at a level of complexity beyond my skill set. I have 15 years of SWE experience, most at FAANG. You just arent close enough to the metal to see what's coming. It's not about what we have now, its about scaling and a reliable march of model improvements. The code has been cracked, given sufficient data, anything can be learned. Neural networks are generalized learners

rglover1y ago

Yes, I use LLMs every day. Primarily for coding (a mix of Claude and OAI). I was trying to implement a simple CSS optimization step to my JS framework's build system last night and both kept hallucinating to the point (literally inventing non-existent APIs and config patterns) where I gave up and just did it by hand w/ Google and browsing docs.

The problem with your "close to the metal" assertion is that this has been parroted about every iteration of LLMs thus far. They've certainly gotten better (impressively so), but again, it doesn't matter. By their very nature (whether today or ten years from now), they're a big risk at the business level which is ultimately where the rubber has to hit the road.

1 more reply

harvodex1y ago

We just are not on your level of genius to understand these things.

So obviously completely full of shit.

ben_w1y ago

Broadly I agree with your position, but:

> If you can find a way to make the context window on the scale of the human brain, you may be able to mostly mitigate this.

Human brains have a much smaller context window than AI do. We can't pay attention to the last 128,000 concepts that filtered past our sensory systems — our conscious considerations are for about seven things.

There's a lot of stuff that we don't yet understand well enough to reproduce with AI, but context length is the wrong criticism for these models.

rglover1y ago

> context length is the wrong criticism for these models

You're right. What I'm getting at is the overall speed, efficiency, and accuracy of the storage, retrieval, and processing capability of the human brain.

1 more reply

potatoman221y ago

Why? Past progress =/= equal rate of future progress.

j / k navigate · click thread line to collapse

0 comments

ryan931y ago

They are using infinity compute and can’t do simple notifications. How will changing the architecture slightly or ingesting more data change that?

rglover1y ago

ldjkfkdsjnvOP1y ago

rglover1y ago

1 more reply

harvodex1y ago

We just are not on your level of genius to understand these things.

So obviously completely full of shit.

ben_w1y ago

Broadly I agree with your position, but:

> If you can find a way to make the context window on the scale of the human brain, you may be able to mostly mitigate this.

There's a lot of stuff that we don't yet understand well enough to reproduce with AI, but context length is the wrong criticism for these models.

rglover1y ago

> context length is the wrong criticism for these models

You're right. What I'm getting at is the overall speed, efficiency, and accuracy of the storage, retrieval, and processing capability of the human brain.

1 more reply

potatoman221y ago

Why? Past progress =/= equal rate of future progress.

j / k navigate · click thread line to collapse