undefined | Better HN

0 pointsTenobrus4d ago0 comments

what basis do you have for assuming an LLM is fundamentally incapable of doing this?

0 comments

What's your basis for assuming LLM is capable of doing this?

I honestly don't know personally either way. Based on my limited understanding of how LLMs work, I don't see them be making the next great song or next great book and based on that reasoning I'm betting that it probably wont be able to do whatever next "Descartes, Newton, Leibnitz, Gauss, Euler, Ramanujan, Galois" are going to do.

Of course AI as a wider field comes up with something more powerful than LLM that would be different.

EMM_3864d ago

"I don't see them be making the next great song"

Meanwhile, songs are hitting number one on some charts on Spotify that people think are humans and are actually AI. And Spotify has to start labelling them as such. One AI "band" had an entire album of hits.

Also - music is a subjective. Mathematics isn't.

And in this case, an LLM discovered a new way to reason about a conjecture. I don't know how much proof is needed - since that is literally proof that it can be done.

truncate4d ago

>> Meanwhile, songs are hitting number one on some charts on Spotify that people think are humans and are actually AI. And Spotify has to start labelling them as such. One AI "band" had an entire album of hits.

There is quite some questions around that. Music is subjective and obviously different people have different taste, but I wouldn't call any of them to be actual good music / real hits.

>> LLM discovered a new way to reason about a conjecture

I wasn't questioning LLMs ability to prove things. Parent threads were talking about building new kind of maths , or approaching it in a creative/artistic way. Thats' what I was referring to.

I can't speak for maths of hard science as I'm not trained in that, but the creativity aspect in code is definitely lacking when it comes to LLMs. May not matter down the line.

dist-epoch4d ago

LLMs are already making the next great songs. Just check out the Billboard charts.

truncate4d ago

I'm sorry, I don't consider them "great songs". Obviously, different people have different taste.

blueone4d ago

> what basis do you have for assuming an LLM is fundamentally incapable of doing this?

because I have no basis for assuming an LLM is fundamentally capable of doing this.

sswatson4d ago

Good on you for spelling out this reasoning, but it is manifestly unsound. For a wide variety of values of X, people a few years ago had no reason to expect that LLMs would be capable of X. Yet here we are.

TheOtherHobbes4d ago

In 1989, Gary Kasparov said that it was "ridiculous!" to suggest a computer would ever beat him at chess.

"Never shall I be beaten by a machine!”

In 1997 he lost to Deep Blue.

FartyMcFarter4d ago

Yeah, and back then people moved the goal posts too, saying Deep Blue was just "brute-forcing" chess (which isn't even true since it's not a pure minimax search).

1 more reply

Applejinx4d ago

And today he's got salient observations on politics which hold much of his attention, and Deep Blue is shut off and has done nothing further.

Not a good argument for turning everything over to the Deep Blues. What's Deep Blue done for me lately?

zardo4d ago

This is something that could be demonstrated rather than just argued.

Train an LLM only on texts dated prior to Newton and see if it can create calculus, derrive the equations of motion, etc.

If you ask it about the nature of light and it directs you to do experiments with a prism I'd say we're really getting somewhere.

gjm114d ago

We tried this experiment with humans, back in the 17th century, and only a few[1] out of millions managed it given a whole human lifetime each.

[1] Obviously Newton counts as one. Leibniz like Newton figured out calculus. Other people did important work in dynamics though no one else's was as impressive as Newton's. But the vast majority of human-level intelligences trained on texts prior to Newton did not create calculus or derive the equations of motion or come close to doing either of those things.

1 more reply

pickleRick2434d ago

Except this has been said since the 2010's and has been proven wrong again and again. Clearly the theory that LLM's can't "extrapolate" is woefully incomplete at best (and most likely simply incorrect). Before the rise of ChatGPT, the onus was on the labs to show it was plausible. At this point, I think the more epistemologically honest position is to put the burden back on the naysayers. At the least, they need to admit they were wrong and give a satisfactory explanation why their conceptual model was unable to account for the tremendous success of LLM's and why their model is still correct going forward. Realistically, progress on the "anti-LLM" side requires a more nuanced conceptual model to be developed carefully outlining and demonstrating the fundamental deficiencies of LLMs (not just deficiencies in current LLMs, but a theory of why further advancements can't solve the deficiencies).

Incidentally, similar conversations were had about ML writ large vs. classical statistics/methods, and now they've more or less completely died down since it's clear who won (I'm not saying classical methods are useless, but rather that it's obvious the naysayers were wrong). I anticipate the same trajectory here. The main difference is that because of the nature of the domain, everyone has an opinion on LLM's while the ML vs. statistics battle was mostly confined within technical/academic spaces.

davebren4d ago

> Clearly the theory that LLM's can't "extrapolate" is woefully incomplete at best (and most likely simply incorrect).

What example is there where an LLM has extrapolated? All I've seen is a data set so large and an extra decomposition process making it so interpolation feels like extrapolation if you don't look close enough.

> but a theory of why further advancements can't solve the deficiencies

How about LeCun's?

dvt4d ago

Because by definition LLMs are permutation machines, not creativity machines. (My premise, which you may disagree with, is that creativity/imagination/artistry is not merely permutation.)

fnordpiglet4d ago

I prefer to think of it as they’re interpolation machines not extrapolation machines. They can project within the space they’re trained in, and what they produce may not be in their training corpus, but it must be implied by it. I don’t know if this is sufficient to make them too weak to create original “ideas” of this sort, but I think it is sufficient to make them incapable of original thought vs a very complex to evaluate expected thought.

drdeca4d ago

People keep saying this, but if you try to interpret this at all literally, it just doesn’t work. Like, it’s phrased like it should have a precise meaning, right? Like, people even mention convex hulls when talking about it.

But if you actually try to take a convex hull of, some encoding of sentences as vectors? It isn’t true. The outputs are not in the convex hull of the training data.

I guess it’s supposed to be a metaphor and not literal, but in that case it’s confusing. Especially seeing as there are contexts in machine learning where literal interpolation vs literal extrapolation, is relevant. So, please, find a better way to say it than saying that “it can only interpolate”?

lukol4d ago

This "new math" might be a recombination of things that we already know - or an obvious pattern that emerges if you take a look at things from a far enough distance - or something that can be brute-forced into existence. All things LLMs are perfectly capable of.

In the end, creativity has always been a combination of chance and the application of known patterns in new contexts.

dvt4d ago

> This "new math" might be a recombination of things that we already know

If you know anything about the invention of new math (analytic geometry, Calculus, etc.), you'd know how untrue this is. In fact, Calculus was extremely hand-wavy and without rigorous underpinnings until the mid 1800s. Again: more art than science.

jfyi4d ago

Newton and Leibniz were "hand-waving"?

If anything, they were fighting an uphill battle against the perception of hand-waving by their contemporaries.

2 more replies

baq4d ago

And yet nowadays you can restate all of it using just combinations of sets of sets and some logic operators.

nh23423fefe4d ago

god of the gaps

iwontberude4d ago

non overlapping magisteria

satvikpendem4d ago

What is creativity if not permutation? A brain has some model of the world and recombines concepts to create new concepts.

1 more reply

KoolKat234d ago

It pretty much is, otherwise it is randomness or entropy.

lajamerr4d ago

LLMs by themselves are not able to but you are missing a piece here.

LLMs are prompted by humans and the right query may make it think/behave in a way to create a novel solution.

Then there's a third factor now with Agentic AI system loops with LLMs. Where it can research, try, experiment in its own loop that's tied to the real world for feedback.

Agentic + LLM + Initial Human Prompter by definition can have it experiment outside of its domain of expertise.

So that's extending the "LLM can't create novel ideas" but I don't think anyone can disagree the three elements above are enough ingredients for an AI to come up with novel ideas.

awesome_dude4d ago

You're proving the GP's argument - LLMs aren't creative you say as much, it's the driving that is the creative force

lajamerr4d ago

You can tell an agentic system. "Go and find a novel area of math that has unresolved answers and solve it mathematically with verified properties in LEAN. Verify before you start working on a problem that no one has solved this area of math"

That's not creative prompt. That's a driving prompt to get it to start its engine.

You could do that nowadays and while it may spend $1,000 to $100,000 worth of tokens. It will create something humans haven't done before as long as you set it up with all its tool calls/permissions.

1 more reply

charlie904d ago

I believe when we have AI Agents "living" 24/7, they will become creative machines. They will test ideas out their own ideas experimentally, come across things accidentally, synthesize new ideas.

We just haven't let AI run wild yet. But its coming.

1 more reply

Barbing4d ago

If that’s a requirement, aren’t LLMs driven by pretraining which was human driven?

Who decides at which the last point it’s OK to provide text to the model in order to be able to describe it as creative? (non-rhetorical)

j / k navigate · click thread line to collapse

0 comments

truncate4d ago

What's your basis for assuming LLM is capable of doing this?

Of course AI as a wider field comes up with something more powerful than LLM that would be different.

EMM_3864d ago

"I don't see them be making the next great song"

Also - music is a subjective. Mathematics isn't.

And in this case, an LLM discovered a new way to reason about a conjecture. I don't know how much proof is needed - since that is literally proof that it can be done.

truncate4d ago

There is quite some questions around that. Music is subjective and obviously different people have different taste, but I wouldn't call any of them to be actual good music / real hits.

>> LLM discovered a new way to reason about a conjecture

I wasn't questioning LLMs ability to prove things. Parent threads were talking about building new kind of maths , or approaching it in a creative/artistic way. Thats' what I was referring to.

I can't speak for maths of hard science as I'm not trained in that, but the creativity aspect in code is definitely lacking when it comes to LLMs. May not matter down the line.

dist-epoch4d ago

LLMs are already making the next great songs. Just check out the Billboard charts.

truncate4d ago

I'm sorry, I don't consider them "great songs". Obviously, different people have different taste.

blueone4d ago

> what basis do you have for assuming an LLM is fundamentally incapable of doing this?

because I have no basis for assuming an LLM is fundamentally capable of doing this.

sswatson4d ago

TheOtherHobbes4d ago

In 1989, Gary Kasparov said that it was "ridiculous!" to suggest a computer would ever beat him at chess.

"Never shall I be beaten by a machine!”

In 1997 he lost to Deep Blue.

FartyMcFarter4d ago

Yeah, and back then people moved the goal posts too, saying Deep Blue was just "brute-forcing" chess (which isn't even true since it's not a pure minimax search).

1 more reply

Applejinx4d ago

And today he's got salient observations on politics which hold much of his attention, and Deep Blue is shut off and has done nothing further.

Not a good argument for turning everything over to the Deep Blues. What's Deep Blue done for me lately?

zardo4d ago

This is something that could be demonstrated rather than just argued.

Train an LLM only on texts dated prior to Newton and see if it can create calculus, derrive the equations of motion, etc.

If you ask it about the nature of light and it directs you to do experiments with a prism I'd say we're really getting somewhere.

gjm114d ago

We tried this experiment with humans, back in the 17th century, and only a few[1] out of millions managed it given a whole human lifetime each.

1 more reply

pickleRick2434d ago

davebren4d ago

> Clearly the theory that LLM's can't "extrapolate" is woefully incomplete at best (and most likely simply incorrect).

> but a theory of why further advancements can't solve the deficiencies

How about LeCun's?

dvt4d ago

Because by definition LLMs are permutation machines, not creativity machines. (My premise, which you may disagree with, is that creativity/imagination/artistry is not merely permutation.)

fnordpiglet4d ago

drdeca4d ago

But if you actually try to take a convex hull of, some encoding of sentences as vectors? It isn’t true. The outputs are not in the convex hull of the training data.

lukol4d ago

In the end, creativity has always been a combination of chance and the application of known patterns in new contexts.

dvt4d ago

> This "new math" might be a recombination of things that we already know

jfyi4d ago

Newton and Leibniz were "hand-waving"?

If anything, they were fighting an uphill battle against the perception of hand-waving by their contemporaries.

2 more replies

baq4d ago

And yet nowadays you can restate all of it using just combinations of sets of sets and some logic operators.

nh23423fefe4d ago

god of the gaps

iwontberude4d ago

non overlapping magisteria

satvikpendem4d ago

What is creativity if not permutation? A brain has some model of the world and recombines concepts to create new concepts.

1 more reply

KoolKat234d ago

It pretty much is, otherwise it is randomness or entropy.

lajamerr4d ago

LLMs by themselves are not able to but you are missing a piece here.

LLMs are prompted by humans and the right query may make it think/behave in a way to create a novel solution.

Then there's a third factor now with Agentic AI system loops with LLMs. Where it can research, try, experiment in its own loop that's tied to the real world for feedback.

Agentic + LLM + Initial Human Prompter by definition can have it experiment outside of its domain of expertise.

So that's extending the "LLM can't create novel ideas" but I don't think anyone can disagree the three elements above are enough ingredients for an AI to come up with novel ideas.

awesome_dude4d ago

You're proving the GP's argument - LLMs aren't creative you say as much, it's the driving that is the creative force

lajamerr4d ago

That's not creative prompt. That's a driving prompt to get it to start its engine.

You could do that nowadays and while it may spend $1,000 to $100,000 worth of tokens. It will create something humans haven't done before as long as you set it up with all its tool calls/permissions.

1 more reply

charlie904d ago

I believe when we have AI Agents "living" 24/7, they will become creative machines. They will test ideas out their own ideas experimentally, come across things accidentally, synthesize new ideas.

We just haven't let AI run wild yet. But its coming.

1 more reply

Barbing4d ago

If that’s a requirement, aren’t LLMs driven by pretraining which was human driven?

Who decides at which the last point it’s OK to provide text to the model in order to be able to describe it as creative? (non-rhetorical)

j / k navigate · click thread line to collapse