undefined | Better HN

0 pointsmarcosdumay10mo ago0 comments

LLMs will still be this way 10 years from now.

But IDK if somebody won't create something new that gets better. But there is no reason at all to extrapolate our current AIs into something that solves programing. Whatever constraints that new thing will have will be completely unrelated to the current ones.

0 comments

smokel10mo ago

Stating this without any arguments is not very convincing.

Perhaps you remember that language models were completely useless at coding some years ago, and now they can do quite a lot of things, even if they are not perfect. That is progress, and that does give reason to extrapolate.

Unless of course you mean something very special with "solving programming".

bigstrat200310mo ago

> Perhaps you remember that language models were completely useless at coding some years ago, and now they can do quite a lot of things, even if they are not perfect.

IMO, they're still useless today, with the only progress being that they can produce a more convincing facade of usefulness. I wouldn't call that very meaningful progress.

wvenable10mo ago

I don't know how someone can legitimately say that they're useless. Perfect, no. But useless, also no.

1 more reply

drdeca10mo ago

I’ve found them somewhat useful? Not for big things, and not for code for work.

But for small personal projects? Yes, helpful.

1 more reply

IshKebab10mo ago

They are very clearly not useless. You haven't given them a fair shake.

marcosdumayOP10mo ago

Why state the same arguments everybody has been repeating for ages?

LLMs can only give you code that somebody has wrote before. This is inherent. This is useful for a bunch of stuff, but that bunch won't change if OpenAI decides to spend the GDP of Germany training one instead of Costa Rica.

vidarh10mo ago

> LLMs can only give you code that somebody has wrote before. This is inherent.

This is trivial to prove to be false.

Invent a programming language that does not exist. Describe its semantics to an LLM. Ask it to write a program to solve a problem in that language. It will not always work, but it will work often enough to demonstrate that they are very much capable of writing code that has never been written before.

The first time I tried this was with GPT3.5, and I had it write code in an unholy combination of Ruby and INTERCAL, and it had no problems doing that.

Similarly giving it a grammar of a hypothetical language, and asking it to generate valid text in a language that has not existed before also works reasonably well.

This notion that LLMs only spit out things that has been written before might have been reasonable to believe a few years ago, but it hasn't been a reasonable position to hold for a long time at this point.

1 more reply

JoshCole10mo ago

> LLMs can only give you code that somebody has wrote before.

This premise is false. It is fundamentally equivalent to the claim that a language model being trained on a dataset: ["ABA", "ABB"] would be unable to generate, given input "B" the string "BAB" or "BAA".

1 more reply

Epa09510mo ago

First, how much of coding is really never done before?

And secondly, what you say are false (at least if taken literally). I can create a new programming language, give the definition of it in the prompt, ask it to code something in my language, and expect something out. It might even work.

3 more replies

rhubarbtree10mo ago

That’s not true. LLMs are great translators, they can translate ideas to code. And that doesn’t mean it has to be recalling previously seen text.

nahbruheem10mo ago

Generating unseen code is not hard.

Set rules on what’s valid, which most languages already do; omit generation of known code; generate everything else

The computer does the work, programmers don’t have to think it up.

A typed language example to explain; generate valid func sigs

func f(int1, int2) return int{}

If that’s our only func sig in our starting set then it makes it obvious

Well relative to our tiny starter set func f(int1, int2, int3) return int{} is novel

This Redis post is about fixing a prior decision of a random programmer. A linguistics decision.

That’s why LLMs seem worse than programmers because we make linguistics decisions that fit social idioms.

If we just want to generate all the never before seen in this model code we don’t need a programmer. If we need to abide laws of a flexible language nature, that’s what a programmer is for; compose not just code by compliance with ground truth.

That antirez is good at Redis is a bias since he has context unseen by the LLM. Curious how well antirez would do with an entirely machine generated Redis-clone that was merely guided by experts. Would his intuition for Redis’ implementation be useful to a completely unknown implementation?

He’d make a lot of newb errors and need mentorship, I’m guessing.

2 more replies

Retric10mo ago

Progress sure, but the rate the’ve improved hasn’t been particularly fast recently.

Programming has become vastly more efficient in terms of programmer effort over decades, but making some aspects of the job more efficient just means all your effort it spent on what didn’t improve.

lexandstuff10mo ago

People seem to have forgotten how good the 2023 GPT-4 really was at coding tasks.

mirsadm10mo ago

The latest batch of LLMs has been getting worse in my opinion. Claude in particular seems to be going backwards with every release. The verbosity of the answers is infuriating. You ask it a simple question and it starts by inventing the universe, poorly

apwell2310mo ago

> Perhaps you remember that language models were completely useless at coding some years ago

no i don't remember that. They are doing similar things now that they did 3 yrs ago. They were still a decent rubber duck 3 yrs ago.

vidarh10mo ago

And 6 years ago GPT2 had just been released. You're being obtuse by interpreting "some years" as specifically 3.

j / k navigate · click thread line to collapse

0 comments

smokel10mo ago

Stating this without any arguments is not very convincing.

Unless of course you mean something very special with "solving programming".

bigstrat200310mo ago

> Perhaps you remember that language models were completely useless at coding some years ago, and now they can do quite a lot of things, even if they are not perfect.

IMO, they're still useless today, with the only progress being that they can produce a more convincing facade of usefulness. I wouldn't call that very meaningful progress.

wvenable10mo ago

I don't know how someone can legitimately say that they're useless. Perfect, no. But useless, also no.

1 more reply

drdeca10mo ago

I’ve found them somewhat useful? Not for big things, and not for code for work.

But for small personal projects? Yes, helpful.

1 more reply

IshKebab10mo ago

They are very clearly not useless. You haven't given them a fair shake.

marcosdumayOP10mo ago

Why state the same arguments everybody has been repeating for ages?

vidarh10mo ago

> LLMs can only give you code that somebody has wrote before. This is inherent.

This is trivial to prove to be false.

The first time I tried this was with GPT3.5, and I had it write code in an unholy combination of Ruby and INTERCAL, and it had no problems doing that.

Similarly giving it a grammar of a hypothetical language, and asking it to generate valid text in a language that has not existed before also works reasonably well.

1 more reply

JoshCole10mo ago

> LLMs can only give you code that somebody has wrote before.

1 more reply

Epa09510mo ago

First, how much of coding is really never done before?

3 more replies

rhubarbtree10mo ago

That’s not true. LLMs are great translators, they can translate ideas to code. And that doesn’t mean it has to be recalling previously seen text.

nahbruheem10mo ago

Generating unseen code is not hard.

Set rules on what’s valid, which most languages already do; omit generation of known code; generate everything else

The computer does the work, programmers don’t have to think it up.

A typed language example to explain; generate valid func sigs

func f(int1, int2) return int{}

If that’s our only func sig in our starting set then it makes it obvious

Well relative to our tiny starter set func f(int1, int2, int3) return int{} is novel

This Redis post is about fixing a prior decision of a random programmer. A linguistics decision.

That’s why LLMs seem worse than programmers because we make linguistics decisions that fit social idioms.

He’d make a lot of newb errors and need mentorship, I’m guessing.

2 more replies

Retric10mo ago

Progress sure, but the rate the’ve improved hasn’t been particularly fast recently.

lexandstuff10mo ago

People seem to have forgotten how good the 2023 GPT-4 really was at coding tasks.

mirsadm10mo ago

apwell2310mo ago

> Perhaps you remember that language models were completely useless at coding some years ago

no i don't remember that. They are doing similar things now that they did 3 yrs ago. They were still a decent rubber duck 3 yrs ago.

vidarh10mo ago

And 6 years ago GPT2 had just been released. You're being obtuse by interpreting "some years" as specifically 3.

j / k navigate · click thread line to collapse