undefined | Better HN

0 pointsafavour3y ago0 comments

> What are the implications for society when general thinking, reading, and writing becomes like Chess?

I think going from LSAT to general thinking is still a very, very big leap. Passing exams is a really fascinating benchmark but by their nature these exams are limited in scope, have very clear assessment criteria and a lot of associated and easily categorized data (like example tests). General thought (particularly like, say, coming up with an original idea) is a whole different ball game.

I don't say any of this to denigrate GPT4, it looks amazing. But I'm reminded of the early days of self driving vehicles: with 10% mastered everyone assumed it was a race to 100% and we'd all be in self-driving cars by now. The reality has been a lot more complicated than that.

0 comments

pottspotts3y ago

We are moving the goal posts on AGI very quickly, but it is catching up. I think we need to appreciate the nature of this milestone if we have any hope of controlling potential singularities.

Jevon233y ago

The goalposts have not moved. The goalposts have never been moved. An AGI is an AI that can do everything a human can do, period. If you were starting a startup for example, you wouldn’t need to hire any humans - you would just spin up enough AGI instances and they would design your product, write your code, deploy it, handle your financials, respond to any and all customer interactions, proactively navigate regulations and litigation, and everything else that needs to be done in the management of a business. That is the goalpost for AGI. It’s an artificial human - a human replacement.

scarmig3y ago

Do you mean that an AGI is an AI that can do everything any human can do?

That's a reasonable goal, but it's also not what people were aiming for historically. It's also very expansive: if human level intelligence means outperforming in every field every human that ever lived, that's a high bar to meet. Indeed, it means that no humans have ever achieved human-level intelligence.

2 more replies

_yb2s3y ago

> An AGI is an AI that can do everything a human can do, period

That goalpost makes no sense- AIs are not human. They are fundamentally different, and therefore will always have a different set of strengths and weaknesses. Even long after vastly exceeding human intelligence everywhere it counts, it will still also perform worse than us on some tasks. Importantly, an AI wouldn't have to meet your goalpost to be a major threat to humanity, or to render virtually all human labor worthless.

Think about how anthropomorphic this goalpost is if you apply it to other species. "Humans aren't generally intelligent, because their brains don't process scents as effectively as dogs- and still struggle at spatially locating scents."

3 more replies

nextaccountic3y ago

> An AGI is an AI that can do everything a human can do, period

> (...)

> That is the goalpost for AGI. It’s an artificial human - a human replacement.

This considerably moves the goalpost. An AGI can have a different kind of intelligence than humans. If an AGI is as intelligent as a cat, it's still AGI.

More likely, the first AGI we develop will probably greatly exceed humans in some areas but have gaps in other areas. It won't completely replace humans, just like cats don't completely replace humans.

1 more reply

jah2423y ago

I m sorry but in stating the goal posts haven't moved, you've literally just moved the goal posts.

'everything a human can do' is not the same as 'anything any human can do as well as the best humans at that thing (because those are the ones we pay)' - most humans cannot do any of the things you state you are waiting for an AI to do to be 'general'.

Therefore, the first part of your statement is the initial goal post and the second part of your statement implies a very different goal post. The new goal post you propose would imply that most humans are not generally intelligent - which you could argue... but would definitely be a new goal post.

2 more replies

nsxwolf3y ago

Passing the LSAT is a fairly good indicator that a human can be a lawyer. It's not yet a good indicator that a large language model can be a lawyer.

2 more replies

adriand3y ago

The goalposts absolutely have moved: consider the Turing Test as a prime example. If machines cannot pass that test now (and right now it would take a skilled interviewer with substantial domain knowledge to cause a failure), they seem likely to be able to in the very near future.

6 more replies

the84723y ago

> An AGI is an AI that can do everything a human can do, period

GI in AGI stands for general intelligence. If what you said is your benchmark for general intelligence then humans who cannot perform all these tasks to the standard of being hirable are not generally intelligent.

What you're asking for would already be bordering on ASI, artificial superintelligence.

azov3y ago

> An AGI is an AI that can do everything a human can do, period.

By that definition do humans possess general intelligence?

Can you do everything a human can do? Can one human be a replacement for another?

I don't think it makes sense without context. Which human? Which task?..

HDThoreaun3y ago

AGI used to mean to Turing test to many. Obviously that's an incomplete definition and it's good that we've fleshed it out more, but the goalposts have moved.

wongarsu3y ago

That's a pretty high threshold for AGI, I doubt most humans could do all that at a satisfying quality level. We tend to thrive by specialization.

wesnerm23y ago

> If you were starting a startup for example, you wouldn’t need to hire any humans - you would just spin up enough AGI instances and they would design your product, write your code, deploy it, handle your financials, respond to any and all customer interactions, proactively navigate regulations and litigation, and everything else that needs to be done in the management of a business. That is the goalpost for AGI. It’s an artificial human - a human replacement.

I disagree with the premise. A single human isn't likely to be able to perform all these functions. Why do you demand GPT-4 encompass all activities? It is already outperforming most humans in standardized tests that rely only on vision and text. A human needs to trained for these tasks.

It's already a human replacement. OpenAI has already said the GPT-4 "with great impact on functions like support, sales, content moderation, and programming."

chairhairair3y ago

Most humans wouldn’t meet that bar. Most humans can’t even pass these tests after studying near-continuously since birth.

threatofrain3y ago

I’d say the standard of GI whether artificial or not is in generalizable analogical and causal learning.

This could mean something which is below a monkey’s ability to relate to the world and yet more useful than a monkey.

wil4213y ago

The goal posts absolutely have moved. They even changed the word AI to AGI. Just look at the movie AI, it’s about a kid who is a robot who wants to be human. 20+ years ago AI meant what AGI means today.

Andrew_nenakhov3y ago

> If you were starting a startup for example, you wouldn’t need to hire any humans - you would just spin up enough AGI instances ..

No, AGI would not need you to start a startup. It would start it itself.

cameldrv3y ago

Human capabilities vary widely. Is it not AGI if it can’t perform surgery, win Olympic medals, bear children, and figure out what dark matter really is?

mcculley3y ago

A synthetic intelligence as smart as a dog or chimp would have enormous value.

pelorat3y ago

An AGI is an AI with awareness of consciousness and itself.

ijidak3y ago

This is one of the best descriptions of AGI I've ever read.

It's a clear analogy.

This should become an article explaining what AGI really means.

I think the question , "Can this AGI be my start-up co-founder? Or my employee #1?"

Or something like that is a great metric for when we've reached the AGI finish line.

3 more replies

camjohnson263y ago

This is a popular take, but does it hold up to reality? From what I’ve seen most people have long expected AI to solve standardized tests, even more free form ones like the LSAT. LLMs’ new abilities are mostly just because of faster and cheaper training and huge amounts of data, but I don’t see anything it can solve that doesn’t use pattern matching.

There are many things that pattern matching over large amounts of data can solve, like eventually we can probably get fully generated movies, music compositions, and novels, but the problem is that all of the content of those works will have to have been formalized into rules before it is produced, since computers can only work with formalized data. None of those productions will ever have an original thought, and I think that’s why GPT-3’s fiction feels so shallow.

So it boils down to a philosophical question, can human thought be formalized and written in rules? If it can, no human ever has an original thought either, and it’s a moot point.

burlesona3y ago

I agree with your take, but will emphasize that the recent wave of AI progress has me questioning how much of human intelligence just reduces to pattern matching. There's certainly a lot of things, like painting, that most people wouldn't have called "pattern matching" a few years ago and now seem to clearly fall into that category.

5 more replies

jimbokun3y ago

> but I don’t see anything it can solve that doesn’t use pattern matching.

Do you have evidence that human brains are not just super sophisticated pattern matching engines?

Humans read novels, listen to compositions, watch movies, and make new ones similar in some ways and different in other ways. What is fundamentally different about the process used for LLMs? Not the current generation necessarily, but what's likely to emerge as they continue to improve.

3 more replies

javajosh3y ago

We are about to test the tests, so to speak, and discover whether an agent that aces a test is capable of doing "real work". Meaning information work you would normally pay a human to do. Paperwork stuff, managing accounts, but also programming and social media marketing. Anything mediated by a computer.

If so it means the union of all human expertise is a few gigabytes. Having seen both a) what we can do in a kilobyte of code, and b) a broad range of human behavior, this doesn't seem impossible. The more interesting question is: what are humans going to do with this remarkable object, a svelte pocket brain, not quite alive, a capable coder in ALL languages, a shared human artifact that can ace all tests? "May you live in interesting times," indeed.

WastingMyTime893y ago

> but the problem is that all of the content of those works will have to have been formalized into rules before it is produced, since computers can only work with formalized data.

Clearly the key takeaway from GPT is that given enough unstructured data, LLM can produce impressive results.

From my point of view, the flaw in most discussion surrounding AI is not that people underestimate computers but overestimate how special humans are. At the end of day, every thoughts are a bunch of chemical potentials changing in a small blob of flesh.

loandbehold3y ago

Sounds like Chinese Room argument. Maybe human intelligence is just a pattern matching?

1 more reply

sirsinsalot3y ago

We might consider certain humans to have had innovative or original thoughts.

It is probably true that at a given point many many people had the same or very similar ideas.

Those who execute or are in the right place and time to declare themselves the originator are the ones we think innovated.

It isn't true. Or rarely is true. History is written by the victor (and their simps)

kordlessagain3y ago

> can human thought be formalized and written in rules

No, and I think it's because human thought is based on continuous inferencing of experience, which gives rise to the current emotional state and feeling of it. For a machine to do this, it will need a body and the ability to put attention on things it is inferencing at will.

1 more reply

xmprt3y ago

I think there are two different things that people are talking about when they say AGI - usefulness and actual general intelligence. I think we're already passed the point where these AIs are very useful and not just in a Siri or Google Assistant way and the goal posts for that have moved a little bit (mostly around practicality so the tools are in everyone's hands). But general intelligence is a much loftier goal and I think that we're eventually going to hit another road block regardless of how much progress we can make towards that end.

madaxe_again3y ago

What is this general intelligence of which you speak? The things that we generally regard as people are essentially language models that run on meat hardware with a lizard-monkey operating system. Sapir-whorf/linguistic relativity more or less demonstrates that "we" are products of language - our rational thought generally operates in the language layer. If it walks like a duck, quacks like a duck, looks like a duck - then you've got yourself a duck.

To be honest, perhaps the language model works better without the evolutionary baggage.

That isn't to discount the other things we can do with our neural nets - for instance, it is possible to think without language - see music, instantaneous mental arithmetic, intuition - but these are essentially independent specialised models that we run on the same hardware that our language model can interrogate. We train these models from birth.

Whether intentional or not, AI research is very much going in the direction of replicating the human mind.

3 more replies

tspike3y ago

> I think that we're eventually going to hit another road block regardless of how much progress we can make towards that end.

I have a sneaking suspicion that all that will be required for bypassing the upcoming road blocks is giving these machines:

1) existential needs that must be fulfilled

2) active feedback loops with their environments (continuous training)

6gvONxR4sf7o3y ago

The goalposts never moved, but you're right that we're catching up quickly.

We always thought that if AI can do X then it can do Y and Z. It keeps turning out that you can actually get really good at doing X without being able to do Y and Z, so it looks like we're moving the goalposts, when we're really just realizing that X wasn't as informative as we expected. The issue is that we can't concretely define Y and Z, so we keep pointing at the wrong X.

But all indication is that we're getting closer.

intended3y ago

We seem to be taking stands on either side of

> “there are/are not, additional properties to human level symbol manipulation, beyond what GPT encapsulates.”

GPT does appear to do an awful lot, before we find the limits, of pattern extrapolation.

nradov3y ago

No one has moved the goal posts. Let's see a computer pass a rigorous Turing test conducted by an interdisciplinary panel of expert evaluators. That has long been considered the gold standard for identifying the arrival of true AGI. GPT-4 is a tremendous technical achievement, but still far from that level.

The notion of some sort of technological "singularity" is just silly. It is essentially an article of faith, a secular religion among certain pseudo-intellectual members of the chattering class. There is no hard scientific backing for it.

CuriouslyC3y ago

If we had a large dataset of experts interrogating AI/people and noting answers that raised suspicion, we'd have AI passing the Turing test more often than actual people very quickly.

frumper3y ago

A Turing test doesn't require that the AI know the answers to the experts, only that it responds in a way that is equivalent of a person. It would be perfectly acceptable to answer I don't have a clue. You're asking for super intelligence.

jstx13y ago

The goalposts don't matter. If we all agreed today that we have AGI, nothing would be different tomorrow.

andsoitis3y ago

> We are moving the goal posts on AGI

What, in your mind, should the goal posts be for AGI?

rdedev3y ago

I guess till some model explicitly says that it's sentient without any input, we would keep pushing the goal posts.

sebzim45003y ago

I got LLaMA to say that it was sentient without mentioning sentience at all, I think this is a pretty bad metric.

kvetching3y ago

Silicon chips will never be able to generate a bound qualia space as we have.

Currently, you could prompt GPT to act as if it is sentient and has qualia, and it will do quite a good job at trying to convince you it's not a P-Zombie.

2 more replies

Red_Leaves_Flyy3y ago

Therein lies the rub. Has anyone wired their models to have real-time data ingestion and the ability to output at will in a variety of mediums? Wake me when we’re there.

paganel3y ago

Because those were the real goal-posts all along, some of the best SF novels written all the way back in the ‘50s and ‘60s are testimony to that.

bespokedevelopr3y ago

> Passing exams is a really fascinating benchmark but by their nature these exams are limited in scope, have very clear assessment criteria and a lot of associated and easily categorized data

I know I’m not the first to say this, but this is also a generalization of many jobs performed right now.

Follow the template, click the boxes, enter the text/data in the standard format, submit before 4pm. Come in tomorrow and do it again.

camjohnson263y ago

Humans are at their best correcting and finding errors in the integration between automated systems. Yes we probably won’t have accountants manually typing data from a page into a computer in the future, but we’ll always have people reviewing and checking the automation.

If that automation doesn’t require oversight, everyone wins, since now that process, typing data from a ledger, is free to anyone who wants to use it. The exception of course is if a monopoly or oligopoly controls the process, so it’s up to the government to break them up and keep the underlying tech accessible.

The biggest risk is how much computing power it takes to run these models, so it’s very important to support the open alternatives that are trying to lower the barrier to entry.

loandbehold3y ago

Peak denialism? Answering LSAT questions requires general intelligence. They present real life scenarios that test-taker has to understand. It requires "common sense" knowledge about the world and reasoning ability. It's not something you can memorize answers to or solve by following prescribed patterns or templates. And GPT-4 wasn't trained specifically to solve LSAT questions.

jjeaff3y ago

For the human brain, the LSAT requires reasoning. But not for an LLM. Do we even know exactly what data this is trained on? I have only seen vague references to what data they are using. If it is trained on large chunks of the internet, then it certainly is trained on LSAT practice questions. And because LSAT questions follow a common pattern, it is well suited to a LLM. There isn't any reasoning or general intelligence at all. Just really good statistics applied to large amounts of data.

moffkalast3y ago

> For the human brain, the LSAT requires reasoning. But not for an LLM.

Exactly, much like a chess bot can play perfectly without what humans would call thinking.

I think (ironically) we'll soon realize that there is no actual task that would require thinking as we know it.

1 more reply

ianbutler3y ago

From the article: "We did no specific training for these exams. A minority of the problems in the exams were seen by the model during training, but we believe the results to be representative—see our technical report for details."

1 more reply

criddell3y ago

> It's not something you can memorize answers to or solve by following prescribed patterns or templates.

If that were true, there would be no point in studying or doing any LSAT preparation. Writing practice exams would be of no benefit.

jack_riminton3y ago

Bingo. These are very 'human' tasks.

As others have said elsewhere, the issue remains accuracy. I wish every response comes with an accurate estimation of how true the answer is, because at the moment it gives wrong answers as confidently as right ones.

1attice3y ago

So the thing is, giving wrong answers with confidence is literally what we train students to do when they are unsure.

I can remember my GRE coach telling me that it was better to confidently choose an answer I only had 50% confidence in, rather than punt on the entire question.

AIs hallucinate because, statistically, it is 'rewarding' for them to do so. (In RLHF)

1 more reply

gcanyon3y ago

> Answering LSAT questions requires general intelligence.

Obviously not, since GPT-4 doesn't have general intelligence. Likewise "common sense," "knowledge about the world," nor "reasoning ability."

As just one example, reasoning ability: GPT-4 failed at this problem I just came up with: "If Sarah was twice as old as Jimmy when Jimmy was 1/3 as old as Jane, and Jane is as much older than Sarah as Sarah is older than Jimmy, and Sarah is now 40, how old are Jane and Jimmy?"

First, every answer GPT-4 came up with contradicted the facts given: they were just wrong. But beyond that, it didn't recognize that there are many solutions to the problem. And later when I gave it an additional constraint to narrow it to one solution, it got the wrong answer again. And when I say "wrong," I mean that its answer clearly contradicted the facts given.

nopinsight3y ago

General thinking requires an AGI, which GPT-4 is not. But it can already have a major impact. Unlike self-driving cars which we require 99.999+% safety to be deployed widely, people already use the imperfect GPT-3 and ChatGPT for many productive tasks.

Driving as well as an attentive human in real time, in all conditions, probably requires AGI as well.

GPT-4 is not an AGI and GPT-5 might not be it yet. But the barriers toward it are getting thinner and thinner. Are we really ready for AGI in a plausibly-within-our-lifetime future?

Sam Altman wrote that AGI is a top potential explanation for the Fermi Paradox. If that were remotely true, we should be doing 10x-100x work on AI Alignment research.

mustacheemperor3y ago

Even just in the exam passing category, GPT4 showed no improvement over GPT3.5 on AP Language & Composition or AP English Literature, and scored quite poorly.

Now, granted, plenty of humans don't score above a 2 on those exams either. But I think it's indicative that there's still plenty of progress left to make before this technology is indistinguishable from magic.

zamnos3y ago

The big huge difference is that cars have this unfortunate thing where if they crash, people get really hurt or killed, especially pedestrians. And split second response time matters, so it's hard for a human operator to just jump in. If ChatGPT-4 hallucinates an answer, it won't kill me. If a human needs to proofread the email it wrote before sending, it'll wait for seconds or minutes.

afavourOP3y ago

> If ChatGPT-4 hallucinates an answer, it won't kill me

Sure but look in this thread, there are already plenty of people citing the use of GPT in legal or medical fields. The danger is absolutely real if we march unthinkingly towards an AI-driven future.

greatpatton3y ago

Who is using ChatGPT in a medical field (serious question), knowing that it only displays very shallow level of knowledge on specific topic?

slingnow3y ago

> If ChatGPT-4 hallucinates an answer, it won't kill me

Not yet it won't. It doesn't take much imagination to foresee where this kind of AI is used to inform legal or medical decisions.

SoftTalker3y ago

Real human doctors kill people by making mistakes. Medical error is a non-trivial cause of deaths. An AI doctor only needs to be better than the average human doctor, isn't that what we always hear about self-driving cars?

And medicine is nothing but pattern matching. Symptoms -> diagnosis -> treatment.

lynguist3y ago

Your last paragraph weakens the argument that you’re making.

Driving assistance and the progress made there and large language models and the progress made there are absolutely incomparable.

The general public’s hype in driving assistance is fueled mostly by the hype surrounding one car maker and its figurehead and it’s a hype that’s been fueled for a few years and become accepted in the public, reflected in the stock price of that car maker.

Large language models have not yet perpetrated the public’s memory yet, and, what’s actually the point is that inside of language you can find our human culture. And inside a large language model you have essentially the English language with its embeddings. It is real, it is big, it is powerful, it is respectable research.

There’s nothing in driving assistance that can be compared to LLMs. They don’t have an embedding of the entire physical surface of planet earth or understanding of driving physics. They’re nothing.

dang3y ago

We detached this perfectly fine subthread from https://news.ycombinator.com/item?id=35154722 in an attempt to spare our poor server, which has smoke coming out of its ears today :( - sorry. We're still working on this and one day it will be better.

nanidin3y ago

What might be interesting is to feed in the transcripts & filings from actual court cases and ask the LLM to write the judgement, then compare notes vs the actual judge.

the_gipsy3y ago

Define: "general thinking".

j / k navigate · click thread line to collapse

0 comments

pottspotts3y ago

We are moving the goal posts on AGI very quickly, but it is catching up. I think we need to appreciate the nature of this milestone if we have any hope of controlling potential singularities.

Jevon233y ago

scarmig3y ago

Do you mean that an AGI is an AI that can do everything any human can do?

2 more replies

_yb2s3y ago

> An AGI is an AI that can do everything a human can do, period

3 more replies

nextaccountic3y ago

> An AGI is an AI that can do everything a human can do, period

> (...)

> That is the goalpost for AGI. It’s an artificial human - a human replacement.

This considerably moves the goalpost. An AGI can have a different kind of intelligence than humans. If an AGI is as intelligent as a cat, it's still AGI.

1 more reply

jah2423y ago

I m sorry but in stating the goal posts haven't moved, you've literally just moved the goal posts.

2 more replies

nsxwolf3y ago

Passing the LSAT is a fairly good indicator that a human can be a lawyer. It's not yet a good indicator that a large language model can be a lawyer.

2 more replies

adriand3y ago

6 more replies

the84723y ago

> An AGI is an AI that can do everything a human can do, period

What you're asking for would already be bordering on ASI, artificial superintelligence.

azov3y ago

> An AGI is an AI that can do everything a human can do, period.

By that definition do humans possess general intelligence?

Can you do everything a human can do? Can one human be a replacement for another?

I don't think it makes sense without context. Which human? Which task?..

HDThoreaun3y ago

AGI used to mean to Turing test to many. Obviously that's an incomplete definition and it's good that we've fleshed it out more, but the goalposts have moved.

wongarsu3y ago

That's a pretty high threshold for AGI, I doubt most humans could do all that at a satisfying quality level. We tend to thrive by specialization.

wesnerm23y ago

It's already a human replacement. OpenAI has already said the GPT-4 "with great impact on functions like support, sales, content moderation, and programming."

chairhairair3y ago

Most humans wouldn’t meet that bar. Most humans can’t even pass these tests after studying near-continuously since birth.

threatofrain3y ago

I’d say the standard of GI whether artificial or not is in generalizable analogical and causal learning.

This could mean something which is below a monkey’s ability to relate to the world and yet more useful than a monkey.

wil4213y ago

Andrew_nenakhov3y ago

> If you were starting a startup for example, you wouldn’t need to hire any humans - you would just spin up enough AGI instances ..

No, AGI would not need you to start a startup. It would start it itself.

cameldrv3y ago

Human capabilities vary widely. Is it not AGI if it can’t perform surgery, win Olympic medals, bear children, and figure out what dark matter really is?

mcculley3y ago

A synthetic intelligence as smart as a dog or chimp would have enormous value.

pelorat3y ago

An AGI is an AI with awareness of consciousness and itself.

ijidak3y ago

This is one of the best descriptions of AGI I've ever read.

It's a clear analogy.

This should become an article explaining what AGI really means.

I think the question , "Can this AGI be my start-up co-founder? Or my employee #1?"

Or something like that is a great metric for when we've reached the AGI finish line.

3 more replies

camjohnson263y ago

So it boils down to a philosophical question, can human thought be formalized and written in rules? If it can, no human ever has an original thought either, and it’s a moot point.

burlesona3y ago

5 more replies

jimbokun3y ago

> but I don’t see anything it can solve that doesn’t use pattern matching.

Do you have evidence that human brains are not just super sophisticated pattern matching engines?

3 more replies

javajosh3y ago

WastingMyTime893y ago

> but the problem is that all of the content of those works will have to have been formalized into rules before it is produced, since computers can only work with formalized data.

Clearly the key takeaway from GPT is that given enough unstructured data, LLM can produce impressive results.

loandbehold3y ago

Sounds like Chinese Room argument. Maybe human intelligence is just a pattern matching?

1 more reply

sirsinsalot3y ago

We might consider certain humans to have had innovative or original thoughts.

It is probably true that at a given point many many people had the same or very similar ideas.

Those who execute or are in the right place and time to declare themselves the originator are the ones we think innovated.

It isn't true. Or rarely is true. History is written by the victor (and their simps)

kordlessagain3y ago

> can human thought be formalized and written in rules

1 more reply

xmprt3y ago

madaxe_again3y ago

To be honest, perhaps the language model works better without the evolutionary baggage.

Whether intentional or not, AI research is very much going in the direction of replicating the human mind.

3 more replies

tspike3y ago

> I think that we're eventually going to hit another road block regardless of how much progress we can make towards that end.

I have a sneaking suspicion that all that will be required for bypassing the upcoming road blocks is giving these machines:

1) existential needs that must be fulfilled

2) active feedback loops with their environments (continuous training)

6gvONxR4sf7o3y ago

The goalposts never moved, but you're right that we're catching up quickly.

But all indication is that we're getting closer.

intended3y ago

We seem to be taking stands on either side of

> “there are/are not, additional properties to human level symbol manipulation, beyond what GPT encapsulates.”

GPT does appear to do an awful lot, before we find the limits, of pattern extrapolation.

nradov3y ago

CuriouslyC3y ago

If we had a large dataset of experts interrogating AI/people and noting answers that raised suspicion, we'd have AI passing the Turing test more often than actual people very quickly.

frumper3y ago

jstx13y ago

The goalposts don't matter. If we all agreed today that we have AGI, nothing would be different tomorrow.

andsoitis3y ago

> We are moving the goal posts on AGI

What, in your mind, should the goal posts be for AGI?

rdedev3y ago

I guess till some model explicitly says that it's sentient without any input, we would keep pushing the goal posts.

sebzim45003y ago

I got LLaMA to say that it was sentient without mentioning sentience at all, I think this is a pretty bad metric.

kvetching3y ago

Silicon chips will never be able to generate a bound qualia space as we have.

Currently, you could prompt GPT to act as if it is sentient and has qualia, and it will do quite a good job at trying to convince you it's not a P-Zombie.

2 more replies

Red_Leaves_Flyy3y ago

Therein lies the rub. Has anyone wired their models to have real-time data ingestion and the ability to output at will in a variety of mediums? Wake me when we’re there.

paganel3y ago

Because those were the real goal-posts all along, some of the best SF novels written all the way back in the ‘50s and ‘60s are testimony to that.

bespokedevelopr3y ago

> Passing exams is a really fascinating benchmark but by their nature these exams are limited in scope, have very clear assessment criteria and a lot of associated and easily categorized data

I know I’m not the first to say this, but this is also a generalization of many jobs performed right now.

Follow the template, click the boxes, enter the text/data in the standard format, submit before 4pm. Come in tomorrow and do it again.

camjohnson263y ago

The biggest risk is how much computing power it takes to run these models, so it’s very important to support the open alternatives that are trying to lower the barrier to entry.

loandbehold3y ago

jjeaff3y ago

moffkalast3y ago

> For the human brain, the LSAT requires reasoning. But not for an LLM.

Exactly, much like a chess bot can play perfectly without what humans would call thinking.

I think (ironically) we'll soon realize that there is no actual task that would require thinking as we know it.

1 more reply

ianbutler3y ago

1 more reply

criddell3y ago

> It's not something you can memorize answers to or solve by following prescribed patterns or templates.

If that were true, there would be no point in studying or doing any LSAT preparation. Writing practice exams would be of no benefit.

jack_riminton3y ago

Bingo. These are very 'human' tasks.

1attice3y ago

So the thing is, giving wrong answers with confidence is literally what we train students to do when they are unsure.

I can remember my GRE coach telling me that it was better to confidently choose an answer I only had 50% confidence in, rather than punt on the entire question.

AIs hallucinate because, statistically, it is 'rewarding' for them to do so. (In RLHF)

1 more reply

gcanyon3y ago

> Answering LSAT questions requires general intelligence.

Obviously not, since GPT-4 doesn't have general intelligence. Likewise "common sense," "knowledge about the world," nor "reasoning ability."

nopinsight3y ago

Driving as well as an attentive human in real time, in all conditions, probably requires AGI as well.

GPT-4 is not an AGI and GPT-5 might not be it yet. But the barriers toward it are getting thinner and thinner. Are we really ready for AGI in a plausibly-within-our-lifetime future?

Sam Altman wrote that AGI is a top potential explanation for the Fermi Paradox. If that were remotely true, we should be doing 10x-100x work on AI Alignment research.

mustacheemperor3y ago

Even just in the exam passing category, GPT4 showed no improvement over GPT3.5 on AP Language & Composition or AP English Literature, and scored quite poorly.

zamnos3y ago

afavourOP3y ago

> If ChatGPT-4 hallucinates an answer, it won't kill me

Sure but look in this thread, there are already plenty of people citing the use of GPT in legal or medical fields. The danger is absolutely real if we march unthinkingly towards an AI-driven future.

greatpatton3y ago

Who is using ChatGPT in a medical field (serious question), knowing that it only displays very shallow level of knowledge on specific topic?

slingnow3y ago

> If ChatGPT-4 hallucinates an answer, it won't kill me

Not yet it won't. It doesn't take much imagination to foresee where this kind of AI is used to inform legal or medical decisions.

SoftTalker3y ago

And medicine is nothing but pattern matching. Symptoms -> diagnosis -> treatment.

lynguist3y ago

Your last paragraph weakens the argument that you’re making.

Driving assistance and the progress made there and large language models and the progress made there are absolutely incomparable.

dang3y ago

nanidin3y ago

What might be interesting is to feed in the transcripts & filings from actual court cases and ask the LLM to write the judgement, then compare notes vs the actual judge.

the_gipsy3y ago

Define: "general thinking".

j / k navigate · click thread line to collapse