undefined | Better HN

https://www.youtube.com/watch?v=kBLkX2VaQs4

smugma2y ago

One of the demos has the voice respond to everything sarcastically. If it can sound sarcastic it’s not a stretch to believe it can “hear” sarcasm.

aantix2y ago

Louis CK - Everything is amazing & nobody is happy

coldtea2y ago

Perhaps everybody is right, and what is amazing is not what matters, and what matters is hardly amazing...

andrewmutz2y ago

Or perhaps the news media has been increasingly effective at convincing us the world is terrible. Perceptions have become measurably detached from reality:

https://www.ft.com/content/af78f86d-13d2-429d-ad55-a11947989...

throwaway_620222y ago

As John Stewart says in https://www.youtube.com/watch?v=20TAkcy3aBY - "How about I hold the fort on making peanut butter sandwiches, because that is something I can do. How about we let AI solve this world climate problem".

Yet to see a true "killer" feature of AI, that isn't doing a job badly which humans can already do badly.

benreesman2y ago

It’s not an either-or: the stuff feels magical because it both represents dramatic revelation of capability and because it is heavily optimized to make humans engage in magical thinking.

These things are amazing compared to old-school NLP: the step-change in capability is real.

But we should also keep our wits about us, they are well-Des robed by current or conjectural mathematics, they fail at things dolphins can do, it’s not some AI god and it’s not self-improving.

Let’s have balance on both the magic of the experience and getting past the tech demo stage: every magic trick has a pledge, but I think we’re still working on the prestige.

cs702OP2y ago

Yes, the announcement explicitly states that much of the effort for this release was focused on things that make it feel magical (response times, multiple domains, etc.), not on moving the needle on quantifiable practical performance. For future releases, the clever folks at OpenAI are surely focused on improving performance on challenging tasks that practical utility -- while maintaining the "magical feeling."

elpakal2y ago

Where does it explicitly say this?

cs702OP2y ago

Explicit ≠ literal.

The things they mention/demo -- response times, multiple domains, inflection and tone, etc. -- are those that make it feel "magical."

porphyra2y ago

Pretty interesting how it turns out that --- contrary to science fiction movies --- talking naturally and modelling language is much easier and was achieved much sooner than solving complex problems or whatever it is that robots in science fiction movies do.

agumonkey2y ago

I didn't use it as a textual interface, but as a relational/nondirectional system, trying to ask it to inverse recursive relationships (first/follow sets for BNF grammars). The fact that it could manage to give partially correct answers on such an abstract problem was "coldly" surprising.

xkcd-sucks2y ago

> its capacities are focused on exactly the right place to feel magical.

this focus subverts its intended effect on those of us with hair trigger bullshit-PTSD

DarkNova62y ago

VC loves it.

Another step closer for those 7 trillion that OpenAI is so desperate for.

BoorishBears2y ago

You really think OpenAI has researchers figuring out how to drive emergent capabilities based on what markets well?

Edit: Apparently not based on your clarification, instead the researchers don't know any better than to march into a local maxima because they're only human and seek to replicate themselves. I assumed too much good faith.

dragonwriter2y ago

I don’t think the intent matters, the effect of its capacities being centered where they are is that they trigger certain human biases.

(Arguably, it is the other way around: they aren’t focused on appealing to those biases, but driven by them, in the that the perception of language modeling as a road to real general reasoning is a manifestation of the same bias which makes language capacity be perceived as magical.)

BoorishBears2y ago

Intent matters when you're being as dismissive as you were.

Not to mention your comment doesn't track at all with the most basic findings they've shared: that adding new modalities increases performance across the board.

They shared that with GPT-4 vs GPT-4V, and the fact this is a faster model than GPT-4V while rivaling it's performance seems like further confirmation of the fact.

It seems like you're assigning emotional biases of your own to pretty straightforward science.

hbn2y ago

That's not what the GP said at all. It was just an explanation for why this demo feels so incredible.

BoorishBears2y ago

GP's follow up is literally

>they aren’t focused on appealing to those biases, but driven by them, in the that the perception of language modeling...

So yes in effect that is their point, except they find the scientists are actually compelled by what markets well, rather than intentionally going after what markets well... which is frankly even less flattering. Like researchers who enabled this just didn't know better than to be seduced by some underlying human bias into a local maxima.

https://amfg.ai/industrial-applications-of-3d-printing-the-u...

croes2y ago

>This stuff feels magical. Magical.

Sound like the people who defend Astrology because it feels magical how their horoscope fits their personality.

"Don't bother me with facts that destroy my rose-tinted view"

At moment AI is a massive hype and shoved into everything. To point at the faults and weaknesses is a reasonable and responsible thing to do.

helicalmix2y ago

i legitimately don't understand this viewpoint.

3 years ago, if you told me you could facetime with a robot, and they could describe the environment and have a "normal" conversation with me, i would be in disbelief, and assume that tech was a decade or two in the future. Even the stuff that was happening a 2 years ago felt unrealistic.

astrology is giving vague predictions like "you will be happy today". GPT-4o is describing to you actual events in real time.

cogman102y ago

People said pretty much exactly the same thing about 3d printing.

"Rather than ship a product, companies can ship blueprints and everyone can just print stuff at their own home! Everything will be 3d printed! It's so magical!"

Just because a tech is magical today, doesn't mean that it will be meaningful tomorrow. Sure, 3d printing has its place (mostly in making plastic parts for things) but it's hardly the revolutionary change in consumer products that it was touted to be. Instead, it's just a hobbiest toy.

GPT-4o being able to describe actual events in real time is interesting, it's yet to be seen if that's useful.

That's mostly the thinking here. A lot of the "killer" AI tech has really boiled down to "Look, this can replace your customer support chat bot!". Everyone is rushing to try and figure out what we can use LLMs (Just like they did when ML was supposed to take over the world) and so far it's been niche locations to make shareholders happy.

helicalmix2y ago

> Sure, 3d printing has its place (mostly in making plastic parts for things) but it's hardly the revolutionary change in consumer products that it was touted to be. Instead, it's just a hobbiest toy.

how sure are you about that?

how positive are you that some benefits in your life are not attributable to 3d-printing used behind the scenes for industrial processes?

> Just like they did when ML was supposed to take over the world

how sure are you that ML is not used behind the scenes to benefit your life? do you consider features like fraud detection programs, protein-folding prediction programs to create, and spam filters valuable in and of themself?

helicalmix2y ago

> GPT-4o being able to describe actual events in real time is interesting, it's yet to be seen if that's useful.

sure, but my experience is that if you are able to optimize better on some previous limitation, it legitimately does open up a whole different world of usefulness.

for example, real-time processing makes me feel like universal translators are now all the more viable

The huge difference between this and your analogy is that 3d printing failed to take off because it never reached mass adoption, and stayed in the "fiddly and expensive" stage. GPT models have already seen adoption in nearly every product your average consumer uses, in some cases heedless of whether it even makes sense in that context. Windows has it built in. Nearly everyone I know (under the age of 40) has used at least one product downstream of OpenAI, and more often than not a handful of them.

That said, yeah it's mostly niche locations like customer support chatbots, because the killer app is "app-to-user interface that's undisguisable from normal human interaction". But you're underestimating just how much of the labor force are effectively just an interface between a customer and some app (like a POS). "Magical" is exactly the requirement to replace people like that.

idopmstuff2y ago

Remember when Chegg's stock price tanked? That's because GPT is extremely valuable as a homework helper. It can make mistakes, but that's very infrequent on well-understood topics like English, math and science through the high school level (and certainly if you hire a tutor, you'd pay a whole lot more for something that can also make mistakes).

Is that not a very meaningful thing to be able to do?

rurp2y ago

Ok, but what will the net effects be? Technology can be extremely impressive on a technical level, but harmful in practical terms.

So far the biggest usecase for LLMs is mass propaganda and scams. The fact that we might also get AI girlfriends out of the tech understandly doesn't seem that appealing to a lot of folks.

helicalmix2y ago

this is a different thesis than "AI is basically bullshit astrology", so i'm not disagreeing with you.

Understanding atomic energy gave us both emission-free energy and the atomic, and you are correct that we can't necessarily where the path of AI will take us.

listenallyall2y ago

There are 8 billion humans you could potentially facetime with. I agree, a large percentage are highly annoying, but there are still plenty of gems out there, and the quest to find one is likely to be among the most satisfying journeys of your life.

helicalmix2y ago

sure, but we're not discussing the outsourcing of human companionship in this context. we're discussing the capabilities of current technology.

croes2y ago

GPT-4o is also describing things that never happened.

The first users of Eliza felt the same about the conversation with it.

The important point is to know that GPTs don't know or understand.

It may feel like a normal conversation but is a Chinese Room on steroids.

People started to ask GPTs questions and take the answers as facts because the believe it's intelligent.

https://www.cbsnews.com/news/half-of-people-remember-events-...

I'm increasing exhausted by the people who will immediately jumps to gnostic assertions that <LLM> isn't <intelligent|reasoning|really thinking|> because <thing that applies to human cognition>

>GPT-4o is also describing things that never happened.

>People started to ask [entity] questions and take the answers as facts because the believe it's intelligent.

Replace that with any political influencer (Ben Shapiro, AOC, etc) and you will see the exact same argument.

People remember things that didn't happen and confidently present things they just made up as facts on a daily basis. This is because they've learned that confidently stating incorrect information is more effective than staying silent when you don't know the answer. LLMs have just learned how to act like a human.

At this point the real stochastic parrots are the people who bring up the Chinese room because it appears the most in their training data of how to respond to this situation.

helicalmix2y ago

> It may feel like a normal conversation but is a Chinese Room on steroids.

Can you prove that humans are not chinese rooms on steroids themselves?

holoduke2y ago

But it may be intelligent. After all you are with a few trillion synapses also intelligent.

demondemidi2y ago

Maybe you just haven't been around enough to seen the meta-analysis? I've been through four major tech hype cycles in 30+ years. This looks and smells like all the others.

HelloMcFly2y ago

I'm 40ish, I'm in the tech industry, I'm online, I'm often an early adopter.

What hype cycle does this smell like? Because it feels different to me, but maybe I'm not thinking broadly enough. If your answer is "the blockchain" or Metaverse then I know we're experiencing these things quite differently.

whimsicalism2y ago

And maybe you just enjoy the perspective of "I've seen it all" so much that you've shut off your capacity for critical analysis.

TulliusCicero2y ago

And some of those hype cycles were very impactful? The spread of consumer internet access, or smartphones, as two examples.

kristiandupont2y ago

If this smells like anything to me, it's the start of the internet.

helicalmix2y ago

which hype cycles are you referring to? and, after the dust settled, do you conclusively believe nothing of value was generated from these hype cycles?

samatman2y ago

Yeah, I remember all that dot com hysteria like it was yesterday.

Page after page of Wired breathlessly predicting the future. We'd shop online, date online, the world's information at our fingertips. It was going to change everything!

Silly now, of course, but people truly believed it.

whimsicalism2y ago

> Sound like the people who defend Astrology because it feels magical how their horoscope fits their personality.

Does it really or are you just playing facile word association games with the word "magical"?

idopmstuff2y ago

Astrology is a thing with no substance whatsoever. It's just random, made-up stories. There is no possibility that it will ever develop into something that has substance.

AI has a great deal of substance. It can draft documents. It can identify foods in a picture and give me a recipe that uses them. It can create songs, images and video.

AI, of course, has a lot of flaws. It does some thing poorly, it does other things with bias, and it's not suitable for a huge number of use cases. To imply that something that has a great deal of substance but flaws alongside is the same as something that has no substance whatsoever nor ever will is just not a reasonable thing to do.

dogcomplex2y ago

If you want to talk facts, then those critics are similarly on weak grounds and critiquing feelings more than facts. There has been no actual sign of scaling ceasing to work, in medium after medium, and most of their criticisms are issues with how LLM tools are embedded in architectures which are still incredibly early/primitive and still refining how to use transformers effectively. We haven't even begun using error correction techniques from analog engineering disciplines properly to boost the signal of LLMs in practical settings. There is so much work to do with just the existing tools.

"AI is massive hype and shoved into everything" has more grounding as a negative feeling of people being overwhelmed with technology than any basis in fact. The faults and weaknesses are buoyed by people trying to acknowledge your feelings than any real criticism of a technology that is changing faster than the faults and weakness arguments can be made. Study machine learning and come back with an informed criticism.

arisAlexis2y ago

What is the point of pointing faults that will be fixed very soon? Just being negative or unable to see the future?

hsavit12y ago

yea, we don't want or need this kind of "magic" - because it's hardly magic to begin with, and it's more socially and environmentally destructive than anything else.

lannisterstark2y ago

Speak for yourself, my workflow and live has been significantly improved with these things. Having easier access to information that I sorta know but want to verify/clarify rather than going into forums/SO is extremely handy.

Not having to write boilerplate code itself also is very handy.

So yes, I absolutely do want this "magic." "I don't like it so no one should use it" is a pretty narrow POV.

oblio2y ago

Both your use cases don't really lead to stable long term valuations in the trillions for the companies building this stuff.

layer82y ago

> HAL's unemotional monotone in Kubrick's movie, "Space Odyssey," feels... primitive by comparison.

I’d strongly prefer that though, along with HAL’s reasoning abilities.

jll292y ago

HAL has to sound exactly how Kubrick made it sound for the movie to work the way it should.

There wasn't any incentive to make it sound artificially emotional or emphatic beyond a "Sorry, Dave".

moffkalast2y ago

I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship. Human empathy already feels lacking compared to what something as basic as llama-3 can do.

layer82y ago

What you say has nothing to do with how an AI speaks.

To use another pop-culture reference, Obi-Wan in Episode IV had deep empathy, but didn’t speak emotionally. Those are separate things.

thfuran2y ago

>I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship

A lot of terrible human behavior is driven by emotions. An emotionless machine will never dump you out the airlock in a fit of rage.

pixl972y ago

Ah, I was tossed out of the airlock in a fit of logic... totally different!

satvikpendem2y ago

> I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship.

Have you seen the final scene of the movie Ex Machina? Without spoilers, I'll just say that acting like has emotions is much more different than actually having them. This is in fact what socio- and psychopaths are like, with stereotypical results.

elicksaur2y ago

llama-3 can’t feel empathy, so this is rather confusing comment.

moffkalast2y ago

Can you prove that you feel empathy? That you're not a cold unfeeling psychopath that is merely pretending extremely well to have emotions? Even if it did, we wouldn't be able to tell the difference from the outside, so in strictly practical terms I don't think it matters.

https://www.youtube.com/live/DQacCB9tDaw?si=2LzQwlS8FHfot7Jy

pmelendez2y ago

> Ignore the critics. Watch the demos. Play with it

With so many smoke and mirrors demos out there, I am not super excited at those videos. I would play with it, but it seems like it is not available in a free tier (I stopped paying OpenAI a while ago after realizing that open models are more than enough for me)

Melatonic2y ago

HAL's voice acting I would say is actually superb and super subtly very much not unemotional. Part of what makes so unnerving. They perfect nailed creepy uncanny valley

barrell2y ago

Did you use any of the GPT voice features before? I’m curious whether this reaction is to the modality or the model.

Don’t get me wrong, excited about this update, but I’m struggling to see what is so magical about it. Then again, I’ve been using GPT voice every day for months, so if you’re just blown away from talking to a computer then I get it

mlsu2y ago

The voice modality plays a huge role in how impressive it seems.

When GPT-2/3/3.5/4 came out, it was fairly easy to see the progression from reading model outputs that it was just getting better and better at text. Which was pretty amazing but in a very intellectual way, since reading is typically a very "intellectual" "front-brain" type of activity.

But this voice stuff really does make it much more emotional. I don't know about you, but the first time I used GPT's voice mode I notice that I felt something -- very un-intellectually, very un-cerebral -- like, the feeling that there is a spirit embodying the computer. Of course with LLM's there always is a spirit embodying the computer (or, there never is, depending on your philosophical beliefs).

The Suno demos that popped up recently should have clued us all in that this kind of emotional range was possible with these models. This announcement is not so much a step function in model capabilities, but it is a step function in HCI. People are just not used to their interactions with a computer be emotional like this. I'm excited and concerned in equal parts that many people won't be truly prepared for what is coming. It's on the horizon, having an AI companion, that really truly makes you feel things.

Us nerds who habitually read text have had that since roughly GPT-3, but now the door has been blown open.

barrell2y ago

Honestly, as someone who has been using this functionality almost daily for months now, the times that break immersion the most by far is when it does human-like things, such as clearing its throat, pandering, or attaching emotions to its responses.

Very excited about faster response times, auto interrupt, cheaper api, and voice api — but the “emotional range” is actually disappointing to me. hopefully it doesn’t impact the default experience too much, or the memory features get good enough that I can stop it from trying to pretend to be a human

famouswaffles2y ago

Speech is a lot more than just the words being conveyed.

Tone, Emphasis, Speed, Accent are all very important parts of how humans communicate verbally.

Before today, voice mode was strictly your audio>text then text>audio. All that information destroyed.

Now the same model takes in audio tokens and spits back out audio tokens directly.

Watch this demo, it's the best example of the kind of thing that would be flat out impossible with the previous setup.

barrell2y ago

Flat out impossible? If you mean “without clicking anything”, sure, but you could interrupt with your thumb, exit chat to send images and go back (maybe video too, I’ve never had any need), and honestly the 2-3 second response time never once bothered me.

I’m very excited about all these updates and it’s really cool tech, but all I’m seeing is quality of life improvements and some cool engineering.

That’s not necessarily a bad thing. Not everything has to be magic or revolutionary to be a cool update

famouswaffles2y ago

Did you even watch the video ? It's just baffling how I have to spell this out.

Skip to 11:50 or watch the very first demo with the breathing. None of that is possible with TTS and STT. You can't ask old voice mode to slow down or modulate tone or anything like that because it's just working with text.

scarface_742y ago

The ability to have an interactive voice conversation has been available for the iOS app for the longest.

kaibee2y ago

Kinda stretching the definition of interactive there.

famouswaffles2y ago

Right but this works differently.

rrrrrrrrrrrryan2y ago

Yeah the product itself is only incrementally better (lower latency responses + can look at a camera feed, both great improvements but nothing mindblowing or "magical"), but I think the big difference is that this thing is available for free users now.

m4632y ago

> HAL's unemotional monotone

on a tangent...

I find it interesting the psychology behind this. If the voice in 2001 had proper inflection, it wouldn't have been perceived as a computer.

(also, I remember when voice synthesizers got more sophisticated and Stephen Hawking decided to keep his original first-gen voice because he identified more with it)

I think we'll be going the other way soon. Perfect voices, with the perfect emotional inflection will be perceived as computers.

However I think at some point they may be anthropomorphized and given more credit than they deserve. This will probably be cleverly planned and a/b tested. And then that perfect voice, for you, will get you to give in.

aiauthoritydev2y ago

1. Demos are meant for feel magical and except in Apple's case they are often exaggerated versions of their real product.

2. Even then this is a wonderful step for tech in general and not just OpenAI. Makes me very excited.

3. Most economic value and growth driven by AI will not come from consumer apps but rather the enterprise use. I am interested in seeing how AI can automatically buy stuff for me, automate my home, reduce my energy used, automatically apply and get credit cards based on my purchases, find new jobs for me, negotiate with a car dealer on my behalf, detect when I am going to fall sick, better diabetes case and eventual cure etc. etc.

lm284692y ago

> It makes the movie "Her" look like it's no longer in the realm of science fiction but in the realm of incremental product development

Are we supposed to cheer to that?

We're already mid way to the full implementation of 1984, do we need Her before we get to Matrix ?

Her wasn’t a dystopia as far as I could tell. Not even a cautionary tale. The scifi ending seems unlikely but everything else is remarkably prescient. I think the picnic scene is very likely to come true in the near future. Things might even improve substantially if we all interact with personalities that are consistently positive and biased towards conflict resolution and non judgemental interactions.

goatlover2y ago

Seemed like a cautionary tale to me where the humans fall in love with disembodied AIs instead of seeking out human interaction. I think the end of the movie drove that home pretty clearly.

Some people in the movie did but not all. It happened enough that it wasn’t considered strange but the central focus wasn’t all of society going down hill because everyone was involved with an AI. If you recall, the human relationships that the characters who fell in love with AIs had were not very good situations. The main character’s arc started off at a low point and then improved while his romance with the AI developed, only reaching a lower point when he felt betrayed and when the AI left him but that might as well be any ordinary relationship. At the end he finds a kindred soul and it’s implied they have some kind of future together whether romantic or not.

lm284692y ago

> Her wasn’t a dystopia as far as I could tell.

Well that's exactly why I'm not looking forward to whatever is coming. The average joe thinking dating a server is not a dystopia frighten me much more than the delusional tech ceo who thinks his ai will revolutionise the world

> Things might even improve substantially if we all interact with personalities that are consistently positive and biased towards conflict resolution and non judgemental interactions.

Some kind of turbo bubble in which you don't even have to actually interact with anyone or anything ? Every "personalities" will be nice to you as long as you send $200 to openai every week, yep that's absolutely a dystopia for me

It really feels like the end goal is living in a pod and being uploaded in an alternative reality, everything we build to "enhance" our lives take us further from the basic building blocks that make life "life".

There’s a lot of hyperbole here but I’ll try to respond. If LLMs can reach a level where they’re effectively indistinguishable from talking to a person then I don’t see anything wrong with someone dating one. People already involve themselves in all kinds of romantic relationships with nonhuman things: anime characters, dolls, celebrities they’ve never met, pillows and substitute relationships with other things like work, art, social media, pets, etc. Adding AI to the list doesn’t make things worse. I think there’s a strong argument that AI relationships would be much healthier than many of the others if they can emulate human interaction to within a very close degree.

The scene which I referenced is one in which a group of three humans and one AI spend time together at a picnic and their interactions are decidedly normal. How many lonely people avoid socializing because they are alone and don’t want to feel like a third wheel? If dating or even just being friends with an AI that can accompany you to such events is accepted and not derided by people who happily have a human companion then I think having a supportive partner could help many people reengage with wider social circles and maybe they will eventually choose to and be able to find other people that they can form relationships with.

OpenAI charges $20 a month which is an extremely reasonable price for a multipurpose tool considering you can’t buy a single meal at a restaurant for the same amount and is far better than the “free” ad supported services that everyone has become addicted to. We’ve been rallying for 20 odd years for payment based services instead of ads but whenever one comes along people shout it down. Funny isn’t it?

The movie Her had an answer for our current fascination for screens as well. It showed a world where computers were almost entirely voice driven with screens playing a secondary role as evidenced by their cell phones looking more like pocket books that close and hide the screen. If you’re worried about pods, well they’re already here and you’re probably holding one in your hands right now. Screens chain us down and mediate our interactions with the world in a way that voice doesn’t. You can walk and talk effortlessly but not so much walking and tapping or typing. If the AI can see and understand what you see (another scene in the movie where he goes on a date with his “phone” in his pocket) and understands enough to not need procedural instructions then it can truly act as an assistant capable of performing assigned tasks and filling in the details while you are free to go about your day. I believe this could end the paradigm of being chained to a desk for office work 8 hours a day and could also transform leisure time as well.

https://chat.openai.com/share/87e4d37c-ec5d-4cda-921c-b6a9c7...

CamperBob22y ago

Imagine what an unfettered model would be like. 'Ex Machina' would no longer be a software-engineering problem, but just another exercise in mechanical and electrical engineering.

The future is indeed here... and it is, indeed, not equitably distributed.

aftbit2y ago

Or from Zones of Thought series, Applied Theology, the study of communication with and creation of superhuman intelligences that might as well be gods.

noman-land2y ago

Magic is maybe not the best analogy to use because magic itself isn't magical. It is trickery.

scarface_742y ago

Some of the failure modes in LLMs have been fixed by augmenting LLMs with external services

The simplest example is “list all of the presidents in reverse chronological order of their ages when inaugurated”.

Both ChatGpt 3.5 and 4 get the order wrong. The difference is that I can instruct ChatGPT 4 to “use Python”

You can do similar things to have it verify information by using internet sources and give you citations.

Just like with the Python example, at least I can look at the script/web citation myself

aspenmayer2y ago

> The simplest example is “list all of the presidents in reverse chronological order of their ages when inaugurated”.

This question is probably not the simplest form of the query you intend to receive an answer for.

If you want a descending list of presidents based on their age at inauguration, I know what you want.

If you want a reverse chronological list of presidents, I know what you want.

When you combine/concatenate the two as you have above, I have no idea what you want, nor do I have any way of checking my work if I assume what you want. I know enough about word problems and how people ask questions to know that you probably have a fairly good idea what you want and likely don’t know how ambitious this question is as asked, and I think you and I both are approaching the question with reasonably good faith, so I think you’d understand or at least accommodate my request for clarification and refinement of the question so that it’s less ambiguous.

Can you think of a better way to ask the question?

Now that you’ve refined the question, do LLMs give you the answers you expect more frequently than before?

Do you think LLMs would be able to ask you for clarification in these terms? That capability to ask for clarification is probably going to be as important as other improvements to the LLM, for questions like these that have many possibly correct answers or different interpretations.

Does that make sense? What do you think?

JustExAWS2y ago

(I seemed to have made the HN gods upset)

I tried asking the question more clearly

I think it “understood” the question because it “knew” how to write the Python code to get the right answer. It parsed the question as expected

The previous link doesn’t show the Python. This one does.

https://chat.openai.com/share/a5e21a97-7206-4392-893c-55c531...

LLMs are generally not good at math. But in my experience ChatGPT is good at creating Python code to solve math problems

aspenmayer2y ago

> I think it “understood” the question because it “knew” how to write the Python code to get the right answer.

That’s what makes me suspicious of LLMs, they might just be coincidentally or accidentally answering in a way that you agree with.

Don’t mean to nitpick or be pedantic. I just think the question was really poorly worded and might have a lot of room for confirmation bias in the results.

wintermutestwin2y ago

It is pretty awesome that you only have to prompt with “use python”

goatlover2y ago

> It makes the movie "Her" look like it's no longer in the realm of science fiction but in the realm of incremental product development.

The last part of the movie "Her" is still in the realm of science fiction, if not outright fantasy. Reminds me of the later seasons of SG1 with all the talk of ascension and Ancients. Or Clarke's 3001 book intro, where the monolith creators figured out how to encode themselves into spacetime. There's nothing incremental about that.

0xdeadbeefbabe2y ago

> HAL's unemotional monotone in Kubrick's movie, "Space Odyssey," feels... oddly primitive by comparison

In comparison to the gas pump which says "Thank You!"

fhub2y ago

I prompted it with "Take this SSML script and give me a woman's voice reading it as WAV or MP3 [Pasted script]" and it pretty much sounds like HAL.

speedgoose2y ago

Did they release the new voices yet?

cess112y ago

You'll have a great time once you discover literature. Especially early modern novels, texts the authors sometimes spent decades refining, under the combined influences of classical arts and thinking, Enlightenment philosophy and science.

If chatbots feel magical, what those people did will feel divinely inspired.

karmasimida2y ago

Very convincing demo

However, using ChatGPT with transcribing is already offering me similar experience, so what is new exactly

agumonkey2y ago

That's what openai managed to catch. The large enough sense of wonder. You could feel it as people spread the news but not as the usual fad.. there was a soft silence to it, people focused deeply poking at it because it was a new interface.

bowsamic2y ago

The demos seem quite boring to me

badgersnake2y ago

Blah blah blah indeed, the hype train continues unabated. The problem is, those are all perfectly valid criticisms and LLMS can never live up to the ridiculous levels of hype.

peterisza2y ago

Can anybody help me try the direct voice feature? I can't find the button for it. Maybe it's not available in Europe yet, I don't know.

nojvek2y ago

> Play with it!

It’s not accessible to everyone yet.

Even on api, I can’t send it voice stream yet.

Api refuses to generate images.

Next few weeks will tell as more people play with it.

WhitneyLand2y ago

How much of this could be implemented using the API?

There’s so much helpful niche functionality that can be added to custom clients.

smugglerFlynn2y ago

Watching HAL happening in real life comes across as creepy, not magical. Double creepy with all the people praising this ‘magicality’.

I’m not a sceptic and apply AI on a daily basis, but whole “we can finally replace people” vibe is extremely off-putting. I had very similar feelings during pandemic, when majority of people was so seemingly happy to drop any real human interaction in favor of remote comms via chats/audio calls, it still creeps me out how ready we are as a society to drop anything remotely human in favor of technocratic advancement and “productivity”.

aftbit2y ago

>Who cares? This stuff feels magical. Magical!

On one hand, I agree - we shouldn't diminish the very real capabilities of these models with tech skepticism. On the other hand, I disagree - I believe this approach is unlikely to lead to human-level AGI.

Like so many things, the truth probably lies somewhere between the skeptical naysayers and the breathless fanboys.

CamperBob22y ago

On the other hand, I disagree - I believe this approach is unlikely to lead to human-level AGI.

You might not be fooled by a conversation with an agent like the one in the promo video, but you'd probably agree that somewhere around 80% of people could be. At what percentage would you say that it's good enough to be "human-level?"

Vegenoid2y ago

When people talk about human-level AGI, they are not referring to an AI that could pass as a human to most people - that is, they're not simply referring to a program that can pass the Turing test.

They are referring to an AI that can use reasoning, deduction, logic, and abstraction like the smartest humans can, to discover, prove, and create novel things in every realm that humans can: math, physics, chemistry, biology, engineering, art, sociology, etc.

thfuran2y ago

The framing of the question admits only one reasonable answer: There is no such threshold. Fooling people into believing something doesn't make it so.

pixl972y ago

Most peoples interactions are transactional. When I call into a company and talk to an agent, and that agent solves the problem I have regardless of if the agent is a person or an AI, where did the fooling occur? The ability to problem solve based on context is intelligence.

CamperBob22y ago

What criteria do you suggest, then?

As has been suggested, the models will get better at a faster rate than humans will get smarter.

layer82y ago

> You might not be fooled by a conversation with an agent like the one in the promo video, but you'd probably agree that somewhere around 80% of people could be.

I think people will quickly learn with enough exposure, and then that percentage will go down.

MVissers2y ago

Nah– These models will improve faster than people can catch up. People or AI models can barely catch AI-created text. It's quickly becoming impossible to distinguish.

The one you catch is the tip of the iceberg.

Same will happen to speech. Might take a few years, but it'll be indistinguishable in a max a few years. Due to compute increase + model improvement, both improving exponentially.

pixl972y ago

No, instead something worse will happen.

Well spoken and well mannered speakers will be called bots. The comment threads under posts will be hurtling insults back and forth on who's actually real. Half the comments will actually be bots doing it. Welcome to the dead internet.

I'm not so sure, I think this is what's called "emergent behavior" — we've found very interesting side effects of bringing together technologies. This might ultimately teach us more about intelligence than more reductionist approaches like scanning and mapping the brain.

dongping2y ago

On the other hand, it is very difficult to distinguish between "emergent behavior" and "somehow leaked into our large training set" for LLMs.

password543212y ago

Comments have become insufferable. Either it is now positive to the point of bordering on cringe-worthiness (your comment) or negative. Nuanced discussion is dead.

byw2y ago

I mean, humans also have tons of failures modes, but we've learned to live them over time.

The average human have tons of quirks, talk over each other all the time, generally can't solve complex problems in a casual conversion setting, and are not always cheery and ready to please like Scarlet's character in Her.

I think our expectations of AI is way too high from our exposure to science fiction.

vwkd2y ago

Funnily, I’d prefer HAL’s unemotional monotone over GPT’s woke hyperbola any second.

suarezluis2y ago

This is such a hot take, it should go in hot-takes.io LOL

OOPMan2y ago

I really don't think Sam needs more encouragement, thanks.

Also, if this is your definition of magic then...yeah...

grantsucceeded2y ago

Magical?

the interruptiopn part is just flow control at the edge. control-s, control-c stuff, right? not AI?

The sound of a female voice to an audience 85% composed of males between the ages of 14 and 55 is "magical", not this thing that recreates it.

so yeah, its flow control and compression of highly curated, subtle soft porn. Subtle, hyper targeted, subconscious porn honed by the most colossal digitally mediated focus group ever constructed to manipulate our (straight male) emotions.

why isn't the voice actually the voice of the pissed off high school janitor telling you to man-up and stop hyperventilating? instead its a woman stroking your ego and telling you to relax and take deep breaths. what dataset did they train that voice on anyway?

It's not that complicated, generally more woman-like voices test as more pleasant to men and women alike. This concept has been backed up by stereotypes for centuries.

Most voice assistants have male options, and an increasing number (including ChatGPT) have gender neutral voices.

> why isn't the voice actually the voice of the pissed off high school janitor telling you to man-up and stop hyperventilating

sounds like a great way to create a product people will outright hate

whimsicalism2y ago

Right, because having a female voice means that it is soft porn.

This is like horseshoe theory on steroids.

mindcrime2y ago

I may or may not entirely agree with this sentiment (but I definitely don't disagree with all of it!) but I will say this: I don't think you deserve to be downvoted for this. Have a "corrective upvote" on me.

j / k navigate · click thread line to collapse

0 comments

dragonwriter2y ago

> This stuff feels magical. Magical.

ChuckMcM2y ago

Kind of this. That was one of the themes of the movie Westworld where the AI in the robots seemed magical until it was creepy.

I worry about the 'cheery intern' response becoming something of a punch line.

"Hey siri, launch the nuclear missiles to end the world."

"That's a GREAT idea, I'll get right on that! Is there anything else I can help you with?"

Kind of punch lines.

indigoabstract2y ago

It's probably just me, but the somewhat forced laughs & smiles from the people talking to it make me feel uneasy.

But enough of that. The future looks bright. Everyone smile!

Or else..

Dig1t2y ago

This is basically just the ship computer from Hitchhikers Guide to the Galaxy.

"Guys, I am just pleased as punch to inform you that there are two thermo-nuclear missiles headed this way... if you don't mind, I'm gonna go ahead and take evasive action."

throwup2382y ago

The jokes write themselves.

gnicholas2y ago

tsunamifury2y ago

Positivity even to the point of toxicity will be the default launch tone for anything... to avoid getting scary.

rrr_oh_man2y ago

Tell that to German customers

(Classic: https://www.counterpunch.org/2011/08/26/germany-chokes-on-wa...)

https://www.youtube.com/watch?v=kBLkX2VaQs4

smugma2y ago

One of the demos has the voice respond to everything sarcastically. If it can sound sarcastic it’s not a stretch to believe it can “hear” sarcasm.

aantix2y ago

Louis CK - Everything is amazing & nobody is happy

coldtea2y ago

Perhaps everybody is right, and what is amazing is not what matters, and what matters is hardly amazing...

andrewmutz2y ago

Or perhaps the news media has been increasingly effective at convincing us the world is terrible. Perceptions have become measurably detached from reality:

https://www.ft.com/content/af78f86d-13d2-429d-ad55-a11947989...

throwaway_620222y ago

Yet to see a true "killer" feature of AI, that isn't doing a job badly which humans can already do badly.

benreesman2y ago

It’s not an either-or: the stuff feels magical because it both represents dramatic revelation of capability and because it is heavily optimized to make humans engage in magical thinking.

These things are amazing compared to old-school NLP: the step-change in capability is real.

But we should also keep our wits about us, they are well-Des robed by current or conjectural mathematics, they fail at things dolphins can do, it’s not some AI god and it’s not self-improving.

Let’s have balance on both the magic of the experience and getting past the tech demo stage: every magic trick has a pledge, but I think we’re still working on the prestige.

cs702OP2y ago

elpakal2y ago

Where does it explicitly say this?

cs702OP2y ago

Explicit ≠ literal.

The things they mention/demo -- response times, multiple domains, inflection and tone, etc. -- are those that make it feel "magical."

porphyra2y ago

agumonkey2y ago

xkcd-sucks2y ago

> its capacities are focused on exactly the right place to feel magical.

this focus subverts its intended effect on those of us with hair trigger bullshit-PTSD

DarkNova62y ago

VC loves it.

Another step closer for those 7 trillion that OpenAI is so desperate for.

BoorishBears2y ago

You really think OpenAI has researchers figuring out how to drive emergent capabilities based on what markets well?

dragonwriter2y ago

I don’t think the intent matters, the effect of its capacities being centered where they are is that they trigger certain human biases.

BoorishBears2y ago

Intent matters when you're being as dismissive as you were.

Not to mention your comment doesn't track at all with the most basic findings they've shared: that adding new modalities increases performance across the board.

They shared that with GPT-4 vs GPT-4V, and the fact this is a faster model than GPT-4V while rivaling it's performance seems like further confirmation of the fact.

It seems like you're assigning emotional biases of your own to pretty straightforward science.

hbn2y ago

That's not what the GP said at all. It was just an explanation for why this demo feels so incredible.

BoorishBears2y ago

GP's follow up is literally

>they aren’t focused on appealing to those biases, but driven by them, in the that the perception of language modeling...

https://amfg.ai/industrial-applications-of-3d-printing-the-u...

croes2y ago

>This stuff feels magical. Magical.

Sound like the people who defend Astrology because it feels magical how their horoscope fits their personality.

"Don't bother me with facts that destroy my rose-tinted view"

At moment AI is a massive hype and shoved into everything. To point at the faults and weaknesses is a reasonable and responsible thing to do.

helicalmix2y ago

i legitimately don't understand this viewpoint.

astrology is giving vague predictions like "you will be happy today". GPT-4o is describing to you actual events in real time.

cogman102y ago

People said pretty much exactly the same thing about 3d printing.

"Rather than ship a product, companies can ship blueprints and everyone can just print stuff at their own home! Everything will be 3d printed! It's so magical!"

GPT-4o being able to describe actual events in real time is interesting, it's yet to be seen if that's useful.

helicalmix2y ago

how sure are you about that?

how positive are you that some benefits in your life are not attributable to 3d-printing used behind the scenes for industrial processes?

> Just like they did when ML was supposed to take over the world

helicalmix2y ago

> GPT-4o being able to describe actual events in real time is interesting, it's yet to be seen if that's useful.

sure, but my experience is that if you are able to optimize better on some previous limitation, it legitimately does open up a whole different world of usefulness.

for example, real-time processing makes me feel like universal translators are now all the more viable

idopmstuff2y ago

Is that not a very meaningful thing to be able to do?

rurp2y ago

Ok, but what will the net effects be? Technology can be extremely impressive on a technical level, but harmful in practical terms.

So far the biggest usecase for LLMs is mass propaganda and scams. The fact that we might also get AI girlfriends out of the tech understandly doesn't seem that appealing to a lot of folks.

helicalmix2y ago

this is a different thesis than "AI is basically bullshit astrology", so i'm not disagreeing with you.

Understanding atomic energy gave us both emission-free energy and the atomic, and you are correct that we can't necessarily where the path of AI will take us.

listenallyall2y ago

helicalmix2y ago

sure, but we're not discussing the outsourcing of human companionship in this context. we're discussing the capabilities of current technology.

croes2y ago

GPT-4o is also describing things that never happened.

The first users of Eliza felt the same about the conversation with it.

The important point is to know that GPTs don't know or understand.

It may feel like a normal conversation but is a Chinese Room on steroids.

People started to ask GPTs questions and take the answers as facts because the believe it's intelligent.

https://www.cbsnews.com/news/half-of-people-remember-events-...

I'm increasing exhausted by the people who will immediately jumps to gnostic assertions that <LLM> isn't <intelligent|reasoning|really thinking|> because <thing that applies to human cognition>

>GPT-4o is also describing things that never happened.

>People started to ask [entity] questions and take the answers as facts because the believe it's intelligent.

Replace that with any political influencer (Ben Shapiro, AOC, etc) and you will see the exact same argument.

At this point the real stochastic parrots are the people who bring up the Chinese room because it appears the most in their training data of how to respond to this situation.

helicalmix2y ago

> It may feel like a normal conversation but is a Chinese Room on steroids.

Can you prove that humans are not chinese rooms on steroids themselves?

holoduke2y ago

But it may be intelligent. After all you are with a few trillion synapses also intelligent.

demondemidi2y ago

Maybe you just haven't been around enough to seen the meta-analysis? I've been through four major tech hype cycles in 30+ years. This looks and smells like all the others.

HelloMcFly2y ago

I'm 40ish, I'm in the tech industry, I'm online, I'm often an early adopter.

whimsicalism2y ago

And maybe you just enjoy the perspective of "I've seen it all" so much that you've shut off your capacity for critical analysis.

TulliusCicero2y ago

And some of those hype cycles were very impactful? The spread of consumer internet access, or smartphones, as two examples.

kristiandupont2y ago

If this smells like anything to me, it's the start of the internet.

helicalmix2y ago

which hype cycles are you referring to? and, after the dust settled, do you conclusively believe nothing of value was generated from these hype cycles?

samatman2y ago

Yeah, I remember all that dot com hysteria like it was yesterday.

Page after page of Wired breathlessly predicting the future. We'd shop online, date online, the world's information at our fingertips. It was going to change everything!

Silly now, of course, but people truly believed it.

whimsicalism2y ago

> Sound like the people who defend Astrology because it feels magical how their horoscope fits their personality.

Does it really or are you just playing facile word association games with the word "magical"?

idopmstuff2y ago

Astrology is a thing with no substance whatsoever. It's just random, made-up stories. There is no possibility that it will ever develop into something that has substance.

AI has a great deal of substance. It can draft documents. It can identify foods in a picture and give me a recipe that uses them. It can create songs, images and video.

dogcomplex2y ago

arisAlexis2y ago

What is the point of pointing faults that will be fixed very soon? Just being negative or unable to see the future?

hsavit12y ago

yea, we don't want or need this kind of "magic" - because it's hardly magic to begin with, and it's more socially and environmentally destructive than anything else.

lannisterstark2y ago

Not having to write boilerplate code itself also is very handy.

So yes, I absolutely do want this "magic." "I don't like it so no one should use it" is a pretty narrow POV.

oblio2y ago

Both your use cases don't really lead to stable long term valuations in the trillions for the companies building this stuff.

layer82y ago

> HAL's unemotional monotone in Kubrick's movie, "Space Odyssey," feels... primitive by comparison.

I’d strongly prefer that though, along with HAL’s reasoning abilities.

jll292y ago

HAL has to sound exactly how Kubrick made it sound for the movie to work the way it should.

There wasn't any incentive to make it sound artificially emotional or emphatic beyond a "Sorry, Dave".

moffkalast2y ago

I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship. Human empathy already feels lacking compared to what something as basic as llama-3 can do.

layer82y ago

What you say has nothing to do with how an AI speaks.

To use another pop-culture reference, Obi-Wan in Episode IV had deep empathy, but didn’t speak emotionally. Those are separate things.

thfuran2y ago

>I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship

A lot of terrible human behavior is driven by emotions. An emotionless machine will never dump you out the airlock in a fit of rage.

pixl972y ago

Ah, I was tossed out of the airlock in a fit of logic... totally different!

satvikpendem2y ago

> I would say a machine that thinks it feels emotions is less likely to throw you out of a spaceship.

elicksaur2y ago

llama-3 can’t feel empathy, so this is rather confusing comment.

moffkalast2y ago

https://www.youtube.com/live/DQacCB9tDaw?si=2LzQwlS8FHfot7Jy

pmelendez2y ago

> Ignore the critics. Watch the demos. Play with it

Melatonic2y ago

HAL's voice acting I would say is actually superb and super subtly very much not unemotional. Part of what makes so unnerving. They perfect nailed creepy uncanny valley

barrell2y ago

Did you use any of the GPT voice features before? I’m curious whether this reaction is to the modality or the model.

mlsu2y ago

The voice modality plays a huge role in how impressive it seems.

Us nerds who habitually read text have had that since roughly GPT-3, but now the door has been blown open.

barrell2y ago

famouswaffles2y ago

Speech is a lot more than just the words being conveyed.

Tone, Emphasis, Speed, Accent are all very important parts of how humans communicate verbally.

Before today, voice mode was strictly your audio>text then text>audio. All that information destroyed.

Now the same model takes in audio tokens and spits back out audio tokens directly.

Watch this demo, it's the best example of the kind of thing that would be flat out impossible with the previous setup.

barrell2y ago

I’m very excited about all these updates and it’s really cool tech, but all I’m seeing is quality of life improvements and some cool engineering.

That’s not necessarily a bad thing. Not everything has to be magic or revolutionary to be a cool update

famouswaffles2y ago

Did you even watch the video ? It's just baffling how I have to spell this out.

scarface_742y ago

The ability to have an interactive voice conversation has been available for the iOS app for the longest.

kaibee2y ago

Kinda stretching the definition of interactive there.

famouswaffles2y ago

Right but this works differently.

rrrrrrrrrrrryan2y ago

m4632y ago

> HAL's unemotional monotone

on a tangent...

I find it interesting the psychology behind this. If the voice in 2001 had proper inflection, it wouldn't have been perceived as a computer.

(also, I remember when voice synthesizers got more sophisticated and Stephen Hawking decided to keep his original first-gen voice because he identified more with it)

I think we'll be going the other way soon. Perfect voices, with the perfect emotional inflection will be perceived as computers.

aiauthoritydev2y ago

1. Demos are meant for feel magical and except in Apple's case they are often exaggerated versions of their real product.

2. Even then this is a wonderful step for tech in general and not just OpenAI. Makes me very excited.

lm284692y ago

> It makes the movie "Her" look like it's no longer in the realm of science fiction but in the realm of incremental product development

Are we supposed to cheer to that?

We're already mid way to the full implementation of 1984, do we need Her before we get to Matrix ?

goatlover2y ago

Seemed like a cautionary tale to me where the humans fall in love with disembodied AIs instead of seeking out human interaction. I think the end of the movie drove that home pretty clearly.

lm284692y ago

> Her wasn’t a dystopia as far as I could tell.

> Things might even improve substantially if we all interact with personalities that are consistently positive and biased towards conflict resolution and non judgemental interactions.

https://chat.openai.com/share/87e4d37c-ec5d-4cda-921c-b6a9c7...

CamperBob22y ago

Imagine what an unfettered model would be like. 'Ex Machina' would no longer be a software-engineering problem, but just another exercise in mechanical and electrical engineering.

The future is indeed here... and it is, indeed, not equitably distributed.

aftbit2y ago

Or from Zones of Thought series, Applied Theology, the study of communication with and creation of superhuman intelligences that might as well be gods.

noman-land2y ago

Magic is maybe not the best analogy to use because magic itself isn't magical. It is trickery.

scarface_742y ago

Some of the failure modes in LLMs have been fixed by augmenting LLMs with external services

The simplest example is “list all of the presidents in reverse chronological order of their ages when inaugurated”.

Both ChatGpt 3.5 and 4 get the order wrong. The difference is that I can instruct ChatGPT 4 to “use Python”

You can do similar things to have it verify information by using internet sources and give you citations.

Just like with the Python example, at least I can look at the script/web citation myself

aspenmayer2y ago

> The simplest example is “list all of the presidents in reverse chronological order of their ages when inaugurated”.

This question is probably not the simplest form of the query you intend to receive an answer for.

If you want a descending list of presidents based on their age at inauguration, I know what you want.

If you want a reverse chronological list of presidents, I know what you want.

Can you think of a better way to ask the question?

Now that you’ve refined the question, do LLMs give you the answers you expect more frequently than before?

Does that make sense? What do you think?

JustExAWS2y ago

(I seemed to have made the HN gods upset)

I tried asking the question more clearly

I think it “understood” the question because it “knew” how to write the Python code to get the right answer. It parsed the question as expected

The previous link doesn’t show the Python. This one does.

https://chat.openai.com/share/a5e21a97-7206-4392-893c-55c531...

LLMs are generally not good at math. But in my experience ChatGPT is good at creating Python code to solve math problems

aspenmayer2y ago

> I think it “understood” the question because it “knew” how to write the Python code to get the right answer.

That’s what makes me suspicious of LLMs, they might just be coincidentally or accidentally answering in a way that you agree with.

Don’t mean to nitpick or be pedantic. I just think the question was really poorly worded and might have a lot of room for confirmation bias in the results.

wintermutestwin2y ago

It is pretty awesome that you only have to prompt with “use python”

goatlover2y ago

> It makes the movie "Her" look like it's no longer in the realm of science fiction but in the realm of incremental product development.

0xdeadbeefbabe2y ago

> HAL's unemotional monotone in Kubrick's movie, "Space Odyssey," feels... oddly primitive by comparison

In comparison to the gas pump which says "Thank You!"

fhub2y ago

I prompted it with "Take this SSML script and give me a woman's voice reading it as WAV or MP3 [Pasted script]" and it pretty much sounds like HAL.

speedgoose2y ago

Did they release the new voices yet?

cess112y ago

If chatbots feel magical, what those people did will feel divinely inspired.

karmasimida2y ago

Very convincing demo

However, using ChatGPT with transcribing is already offering me similar experience, so what is new exactly

agumonkey2y ago

bowsamic2y ago

The demos seem quite boring to me

badgersnake2y ago

Blah blah blah indeed, the hype train continues unabated. The problem is, those are all perfectly valid criticisms and LLMS can never live up to the ridiculous levels of hype.

peterisza2y ago

Can anybody help me try the direct voice feature? I can't find the button for it. Maybe it's not available in Europe yet, I don't know.

nojvek2y ago

> Play with it!

It’s not accessible to everyone yet.

Even on api, I can’t send it voice stream yet.

Api refuses to generate images.

Next few weeks will tell as more people play with it.

WhitneyLand2y ago

How much of this could be implemented using the API?

There’s so much helpful niche functionality that can be added to custom clients.

smugglerFlynn2y ago

Watching HAL happening in real life comes across as creepy, not magical. Double creepy with all the people praising this ‘magicality’.

aftbit2y ago

>Who cares? This stuff feels magical. Magical!

Like so many things, the truth probably lies somewhere between the skeptical naysayers and the breathless fanboys.

CamperBob22y ago

On the other hand, I disagree - I believe this approach is unlikely to lead to human-level AGI.

Vegenoid2y ago

When people talk about human-level AGI, they are not referring to an AI that could pass as a human to most people - that is, they're not simply referring to a program that can pass the Turing test.

thfuran2y ago

The framing of the question admits only one reasonable answer: There is no such threshold. Fooling people into believing something doesn't make it so.

pixl972y ago

CamperBob22y ago

What criteria do you suggest, then?

As has been suggested, the models will get better at a faster rate than humans will get smarter.

layer82y ago

> You might not be fooled by a conversation with an agent like the one in the promo video, but you'd probably agree that somewhere around 80% of people could be.

I think people will quickly learn with enough exposure, and then that percentage will go down.

MVissers2y ago

Nah– These models will improve faster than people can catch up. People or AI models can barely catch AI-created text. It's quickly becoming impossible to distinguish.

The one you catch is the tip of the iceberg.

Same will happen to speech. Might take a few years, but it'll be indistinguishable in a max a few years. Due to compute increase + model improvement, both improving exponentially.

pixl972y ago

No, instead something worse will happen.

dongping2y ago

On the other hand, it is very difficult to distinguish between "emergent behavior" and "somehow leaked into our large training set" for LLMs.

password543212y ago

Comments have become insufferable. Either it is now positive to the point of bordering on cringe-worthiness (your comment) or negative. Nuanced discussion is dead.

byw2y ago

I mean, humans also have tons of failures modes, but we've learned to live them over time.

I think our expectations of AI is way too high from our exposure to science fiction.

vwkd2y ago

Funnily, I’d prefer HAL’s unemotional monotone over GPT’s woke hyperbola any second.

suarezluis2y ago

This is such a hot take, it should go in hot-takes.io LOL

OOPMan2y ago

I really don't think Sam needs more encouragement, thanks.

Also, if this is your definition of magic then...yeah...

grantsucceeded2y ago

Magical?

the interruptiopn part is just flow control at the edge. control-s, control-c stuff, right? not AI?

The sound of a female voice to an audience 85% composed of males between the ages of 14 and 55 is "magical", not this thing that recreates it.