undefined | Better HN

0 pointscal852y ago0 comments

We've had voice input and voice output with computers for a long time, but it's never felt like spoken conversation. At best it's a series of separate voice notes. It feels more like texting than talking.

These demos show people talking to artificial intelligence. This is new. Humans are more partial to talking than writing. When people talk to each other (in person or over low-latency audio) there's a rich metadata channel of tone and timing, subtext, inexplicit knowledge. These videos seem to show the AI using this kind of metadata, in both input and output, and the conversation even flows reasonably well at times. I think this changes things a lot.

0 comments

kkukshtel2y ago

The "magic" moment really hit in this, like you're saying. Watching it happen and being like "this is a new thing". Not only does it respond in basically realtime, it concocts a _whole response_ back to you as well. It's like asking someone what they think about chairs, and then that person being able to then respond to you with a verbatim book on the encyclopedia of chairs. Insane.

I'm also incredibly excited about the possibility of this as an always available coding rubber duck. The multimodal demos they showed really drove this home, how collaboration with the model can basically be as seamless as screensharing with someone else. Incredible.

baq2y ago

Still patiently waiting for the true magic moment where I don't have to chat with the computer, I just tell it what to do and it does it without even an 'OK'.

I don't want to chat with computers to do basic things. I only want to chat with computers when the goal is to iterate on something. If the computer is too dumb to understand the request and needs to initiate iteration, I want no part.

(See also 'The Expanse' for how sci-fi imagined this properly.)

lannisterstark2y ago

We'll get there.

For me, this is seriously impressive, and I already use LLMs everyday - but a serious "Now we're talkin" moment would be when I'd be able to stand outside of Lowes, and talk to my glasses/earbuds "Hey, I'm in front of lowes, where do I get my air filters from?"

and it tells me if it's in stock, aisle and bay number. (If you can't tell, I am tired from fiddling with apps lol)

yakz2y ago

Lowes wants you to look at the ads in the app.

I would guess that most companies will not want to provide APIs that an agent could use to make that kind of query. So, the agent is going to have to use the app just like you would, which looks like it will definitely become possible, but again, Lowes wants the human to see the ads. So they're going to try to break the automation.

It's going to take customers demanding (w/$) this kind of functionality and it will probably still take a long time as the companies will probably do whatever they can to maintain (or extend) control.

6 more replies

mycall2y ago

I want it to instruct me exactly how to achieve things. While agents doing stuff for me is nice, my agency is more important and investing into myself is best. Step by step, how to make bank -- what to say, what to do.

blagie2y ago

It's a process:

- People want agency.

- Once people have comfort and trust that it does things right enough of the time, people no longer want agency.

That threshold varies task-by-task and person-by-person.

2 more replies

IanKerr2y ago

Pretty interesting to me that we're starting to go from humans programming computers to computers programming humans.

1 more reply

narrator2y ago

I think the movie "Her" buried the lead. Why have a girlfriend in one's ear when one could have a compilation of great entrepreneurs multimodally telling you what to do?

kemiller2y ago

Re: The Expanse. I must have missed that. Maybe that’s the point. People no longer think of a computer as some separate thing that needs to be interacted with.

jdmn732y ago

The best example is the scene where Alex has to plot a course to the surface of Ganymede without being detected by the Martian and Earth navies. He goes over multiple iterations of possible courses with the computer adjusting for gravity assists and avoiding patrols etc... by voice pretty seamlessly.

nurettin2y ago

If you are not specific enough, it should at least ask for parameters.

    "Computer, buy some stock"
    *** buys 100 lots of tesla without a prompt

nanomonkey2y ago

Hmmm...maybe I should name my next company Vegetable or Chicken so that folks accidentally buy my stock. Sort of like naming your band "Blank Tape" back in the 90's.

2 more replies

krainboltgreene2y ago

> I don't want to chat with computers to do basic things. I only want to chat with computers when the goal is to iterate on something. If the computer is too dumb to understand the request and needs to initiate iteration, I want no part.

This is called an "employee" and all you need to do is pay them. If you don't want to do that, then I have to wonder: Is what you want slavery?

baq2y ago

Actually yes, I want a blob of quantized floats to be my slave. I definitely don’t want it to be my policeman or philosopher!

1 more reply

nerdponx2y ago

As goofy as I personally think this is, it's pretty cool that we're converging on something like C3P0 or Plankton's Computer with nothing more than the entire corpus of the world's information, a bunch of people labeling data, and a big pile of linear algebra.

trashtester2y ago

All of physics basically reduces to linear algebra locally (which becomes quite unilinear when enough tensors are multiplied).

Why shouldn't we expect AI to be created using the same type of math?

If there is a surprise, it's only that we can use the same math at a much higher level of abstraction than the quantum level.

nerdponx2y ago

That's a really nice analogy, I wonder if there is some research on this kind of thing in the machine learning literature already.

1 more reply

Taikonerd2y ago

After all the people likening this to the computer from "Her," I LOL'ed at you likening it to Plankton's computer from Spongebob.

apantel2y ago

This is why it would be such a mistake to kneecap this process over copyright. The models needs ALL the data.

QuiDortDine2y ago

Okay so we're all agreed that IP laws don't matter and we can have all of OpenAI's data for free? That's a good deal!

1 more reply

danbala2y ago

The goal is to end up with a model capable of discovering all the knowledge on its own. not rely on what humans produced before. Human knowledge contains errors, I want the model to point out those errors and fix them. the current state is a crutch at best to get over the current low capability of the models.

wslack2y ago

Then lawmakers should change the law, instead of a private actor asserting that their need overrides others' rights.

1 more reply

CamperBob22y ago

With you 100% on that, except that after you defeat the copyright cartel, you'll have to face the final boss: OpenAI itself.

Either everybody should get the benefits of this technology, or no one should.

1 more reply

nwienert2y ago

This is an anti-human ideology as bad as the worst of communism.

Humanity only survives as much as it preserves human dignity, let's say. We've designed society to give rewards to people who produce things of value.

These companies take that value and giving nothing back to the creators.

Supporting this will lead to disaster for all but the few, and ultimately for the few themselves.

Paying for your (copyrighted) inputs is harmony.

2 more replies

Der_Einzige2y ago

I'm shocked at how otherwise normally "progressive" folks or even so called "communists" will start to bend over for IP-laws the moment that they start to realize the implications of AI systems. Glad to know that accusations of the "gnulag" were unfounded I guess!

I now don't believe most "creative" types when they try to spout radical egalitarian ideologies. They don't mean it at all, and even my own family, who religiously watched radical techno-optimist shows like Star Trek, are now falling into the depths of ludditism and running into the arms of defending copyright trolls

3 more replies

iAMkenough2y ago

I wonder how long until we see a product that's able to record workstation displays and provide a conversational analysis of work conducted that day by all of your employees.

iforgotpassword2y ago

Or comment on your coding in realtime with a snarky undertone.

If you give it access to the entire codebase at the same time that could work pretty well. Maybe even add an option to disable the sarcasm.

ianmcgowan2y ago

"What's your humor setting, TARS?"

heed2y ago

gpt-4o can at least summarize short meetings: https://x.com/gdb/status/1790164084425646481?s=46&t=8sSeDIGv...

bombcar2y ago

Zoom offers something like this, but it's really obviously geared toward a certain kind of meeting and has difficulty with anything else.

osigurdson2y ago

If it can't summarize into a single scalar value, it would be of little use.

cco2y ago

I believe that company is called Recall.ai

johannboehme2y ago

Thanks god i live in the EU and not in a late stage capitalist hell hole XD

dyauspitr2y ago

How did my direct reports do today?

They did fuck all, especially the ginger.

lottin2y ago

But in this case you're not talking with a real person. Instinctively, I dislike a robot that pretends to be a real human being.

irjustin2y ago

> Instinctively, I dislike a robot that pretends to be a real human being.

Is that because you're not used to it? Honestly asking.

This is probably the first time it feels natural where as all our previous experiences make "chat bots" and "automated phone systems", "automated assistants" absolutely terrible.

Naturally, we dislike it because "it's not human". But this is true of pretty much any thing that approaches "uncanny valley". But, if the "it's not human" solves your answer 100% better/faster than the human counter part, we tend to accept it a lot faster.

This is the first real contender. Siri was the "glimpse" and ChatGPT is probably the reality.

[EDIT]

https://vimeo.com/945587328 the Khan academy demo is nuts. The inflections are so good. It's pretty much right there in the uncanny valley because it does still feel like you're talking to a robot but it also directly interacting with it. Crazy stuff.

AnthonyMouse2y ago

> Naturally, we dislike it because "it's not human".

That wasn't even my impression.

My impression was that it reminds me of the humans that I dislike.

It speaks in customer service voice. That faux friendly tone people use when they're trying to sell you something.

irjustin2y ago

> It speaks in customer service voice. That faux friendly tone people use when they're trying to sell you something.

Mmmmm while I get that, in the context w/ the grandparent comment, having a human wouldn't be better then? It's effectively the same. Because, realistically that's a pretty common voice/tone to get even in tech support.

1 more reply

rpdillon2y ago

You can ask it to use a different voice.

FabHK2y ago

The Khan academy video is very impressive, but I do hope they release a British version that’s not so damn cheerful.

irjustin2y ago

I wonder if you can ask it to change its inflections to match a personal conversation as if you're talking to a friend or a teacher or in your case... a British person?

1 more reply

PheonixPharts2y ago

> This is probably the first time it feels natural

Really? I found this demo painful to watch and literally felt that "cringe" feeling. I showed it to my partner and she couldn't even stand to hear more than a sentence of the conversation before walking away.

It felt both staged and still frustrating to listen to.

And, like far too much in AI right now, a demo that will likely not pan out in practice.

jamil72y ago

This, everyone had to keep interrupting and talking over it to stop it from waffling on.

petereddy2y ago

I had the same reaction. I agree that it sounded very staged, but it also sounded far too cheerful and creepily flirty too. Unbearable.

ulchar2y ago

Emotions are an axiom to convey feelings, but also our sensitivity to human emotions can be a vector for manipulation.

Especially when you consider the bottom line that this tech will be ultimately be horned into advertising somehow (read: the field dedicated to manipulating you into buying shit).

This whole fucking thing bothers me.

maroonblazer2y ago

> Emotions are an axiom to convey feelings, but also our sensitivity to human emotions can be a vector for manipulation.

When one gets to be a certain age one begins to become attuned to this tendency of others' emotions to manipulate you, so you take steps to not let that happen. You're not ignoring their emotions, but you can address the underlying issue more effectively if you're not emotionally charged. It's a useful skill that more people would benefit from learning earlier in life. Perhaps AI will accelerate that particular skill development, which would be a net benefit to society.

3 more replies

irjustin2y ago

> Especially when you consider the bottom line that this tech will be ultimately be horned into advertising somehow.

Tools and the weaponization of them.

This can be said of pretty much any tech tool that has the ability to touch a good portion of the population, including programming languages themselves, CRISPR?

I agree we have to be careful of the bad, but the downsides in this case are not so dangerous that we should be trying to suppress it because the benefits can be incredible too.

1 more reply

jasondigitized2y ago

Why can’t it also inspire you? If I can forgo advertising and have ChatGPT tutor my child on geometry and they actually learn it at a fraction of the cost of a human tutor why is that bothersome? Honest question. Why do some many people default to something sinister going on. If this technology shows real efficacy in education at scale take my money.

5 more replies

Beijinger2y ago

"Naturally, we dislike it because "it's not human"."

This is partly right.

https://en.wikipedia.org/wiki/Uncanny_valley

interludead2y ago

> Siri was the "glimpse" and ChatGPT is probably the reality.

Agree. Can't wait to see how it'll be...

ambrozk2y ago

These sorts of comments are going to go in the annals with the hackernews people complaining about Dropbox when it first came out. This is so revolutionary. If you're not agog you're just missing the obvious.

digging2y ago

Something can be revolutionary and have hideous flaws.

(Arguably, all things revolutionary do.)

I'm personally not very happy about this for a variety of reasons; nor am I saying AI is incapable of changing the entire human condition within our lifetimes. I do claim that we have little reason to believe we're headed in a more-utopian direction with AI.

astrange2y ago

I would say many pets pretend to be human beings (usually babies) in a way that most people like.

jimkleiber2y ago

I think pets often feel real emotions, or at least bodily sensations, and communicate those to humans in a very real way, whether thru barking or meowing or whimpering or whatnot. So while we may care for them as we care for a human, just as we may care for a plant or a car as a human, I think if my car started to say it felt excited for me to give it a drive, I might also feel uncomfortable.

astrange2y ago

They do, but they've evolved neoteny (baby-like cries) to do it, and some of their emotions aren't "human" even though they are really feeling them.

Silly example, but some pets like guinea pigs are almost always hungry and they're famous for learning to squeak at you whenever you open the fridge or do anything that might lead to giving them bell peppers. It's not something you'd put up with a human family member using their communication skills to do!

1 more reply

trashtester2y ago

Adult dogs tend to retain many of the characteristics that wolf puppies have, but grow out of when they become adults.

We've passively bred out many of the behaviors that lead to wolves becoming socially mature. Such dogs tend to be too dangerous to have around, since they may lead to the dogs challenging their owners (more than they already do) for dominance of the family.

AI's will probably be designed to do the same thing, so they will not feel threatening to us. But in the case of AGI/ASI, we will never know if they actually have this kind of subservience, or if they're just faking it for as long as it benefits them.

olddustytrail2y ago

> I think if my car started to say it felt excited for me to give it a drive, I might also feel uncomfortable.

Well, yes, you don't want to sit in a wet seat.

marcosdumay2y ago

They being simple and dumb works for their benefit.

Most people would never accept the same behavior from a being capable of more complex thoughts.

Art96812y ago

Good thing you can tell the AI to speak to you in a robotic monotone and even drop IQ if you feel the need to speak with a dumb bot. Or abstain from using the service completely. You have choices. Use them.

komali22y ago

Until your ISP fires their entire service department in a foolish attempt to "replace" them with an overfunded chatbot-service-department-as-a-service and you have to try to jailbreak your way through it to get to a human.

sniggers2y ago

Not when they've replaced every customer-facing position. Oh and all teachers.

hoag2y ago

But I think this animosity is very much expected, no? Even I felt a momentary hint of "jealousy" -- if you can even call it that -- when I realized that we humans are, in a sense, not really so special anymore.

But of course this was the age-old debate with our favorite golden-eyed android; and unsurprisingly, he too received the same sort of animosity:

Bones was deeply skeptical when he first met Data: "I don't see no points on your ears, boy, but you sound like a Vulcan." And we all know how much he loved those green-blooded fools.

Likewise, Dr. Pulanski has since been criticized for her rude and dismissive attitudes towards Data that had flavors of what might even be considered "racism," or so goes the Trekverse discussion on the topic.

And let's of course not forget when he was on trial essentially for "humanity," or whether hew as indeed just the property of Starfleet, and nothing more.

More recent incarnations of Star Trek: Picard illustrated the outright ban on "synthetics" and indeed their effective banishment; non-synthetic life -- from human to Roman -- simply weren't ok with them.

Yes this is all science fiction silliness -- or adoration depending on your point of view -- but I think it very much reflects the myriad directions our real life world is going to scatter (shatter?) in the coming years ahead.

mrexroad2y ago

s/Pulanski/Pulaski/

Sorry, had to be that trekkie :) and nice job referencing Measure of a Man — such great trek.

chiefalchemist2y ago

To your point, there's been a lot of talk about AI, regulation, guardrails, whatever. Now is the time to say, AI must speak such that we know it's AI and not a real human voice.

We get the upside of conversation, and avoid the downside of falling asleep at the wheel (as Ethan Mollick mentions in "Co-Intelligence".)

nsonha2y ago

I dislike a robot that's equal/surpasses human beings. A silly machine that pretends to be human is what I want.

interludead2y ago

It felt like a videogame for me

x3haloed2y ago

Exactly. I'm not sure if this is brand new or not, but this is definitely on the frontier.

I was literally just thinking about this a few days ago... that we need a multi-modal language model with speech training built-in.

As soon as this thing rolls out, we'll be talking to language models like we talk to each other. Previously it was like dictating a letter and waiting for the responding letter to be read to you. Communication is possible, but not really in the way that we do it with humans.

This is MUCH more human-like, with the ability to interrupt each other and glean context clues from the full richness of the audio.

The model's ability to sing is really fascinating. It's ability to change the sound of its voice -- its pacing, its pitch, its tonality. I don't know how they're controlling all that via GPT-4o tokens, but this is much more interesting stuff than what we had before.

I honestly don't fully understand the implications here.

deanCommie2y ago

> Humans are more partial to talking than writing.

Amazon, Google, and Apple have sunk literally billions of dollars into this idea only to find out that, no, we aren't.

We are with other humans, yes. When socialization is part of the conversation. When I'm talking to my local barista I'm not just ordering a coffee, I'm also maintaining a relationship with someone in my community.

But when it comes to work, writing >>> talking. Writing is clarity of ideas. Talking is cult of personality.

And when it comes to inputs/outputs, typing is more precise and more efficient.

Don't get me wrong, this is an incredibly revolutionary piece of technology, but I don't think the benefits of talking you're describing (timing, subtext, inexplicit knowledge) are achievable here either (for now), since even that requires HOURS of interaction over days/weeks/months of experiences for humans to achieve with each other.

thehappypm2y ago

I think Alexa and Google Assistant simply are too low-intelligence to really consider it “talking” and not just voice commanding

jdietrich2y ago

I use voice assistants and find them quite useful, but I've had to learn the interface and memorise the correct trigger phrases. If GPT-4o works half as well in practice as it does in the demos, then it's categorically a different thing.

deanCommie2y ago

And so are ChatGPT and Gemini even the newest launched versions.

enraged_camel2y ago

>> When I'm talking to my local barista I'm not just ordering a coffee, I'm also maintaining a relationship with someone in my community.

>>> But when it comes to work, writing >>> talking. Writing is clarity of ideas. Talking is cult of personality.

A lot of people think of their colleagues as part of a professional community as well, though.

cal85OP2y ago

I don't think they've sunk $1 into that idea. They've sunk billions into a different idea: that people enjoy using their vocal cords more than their hands to compose messages to send to each other. That is not a spoken conversation, it's just correspondence with voice input/output options.

throwthrowuknow2y ago

Writing is only superior to conversation when weighed against discussions with more than 3 people. A quick call with one or two other people always results in more progress being made as long as everyone involved wants to get it done. Messaging back and forth takes much more time and often leads to misunderstandings.

sroussey2y ago

It depends…

For example, I mentioned something to my contractor and the short thing he said back and his tone had me assume he understood.

Oh, he absolutely did not.

And, with him at least, that doesn’t happen when in writing.

achow2y ago

> Humans are more partial to talking than writing.

Is it so?

Speaking most of the time is for short exchange of information (pleasantries to essential information exchanges).

I prefer writing for long in-depth thought exchanges (whether by emails, blogs etc.)

In many cultures - European or Asian, people are not very loquacious in everyday life.

mrtranscendence2y ago

I wouldn't say speaking is mostly for short exchanges of information. Sometimes it's the opposite: my wife will text me for simple comments or requests, but for anything complicated she'll put the phone to her ear and call me. Or coworkers often want to set up a meeting rather than exchange a series of asynchronous emails -- iteration, brainstorming, Q&A, and the like can be more agile with voice than it can with text.

dmix2y ago

Time and place

I’m 100% a text everything never calls person but I can’t live without Alexa these days, every time I’m in a hotel or on vacation I nearly ask a question out loud.

I also hate how much Alexa sucks so this is a big deal. I spent years weeding out what it could do and can’t do so it will be nice to have one that I don’t have to treat like a toddler

insane_dreamer2y ago

I started using the Pi LLM app (by Inflection.ai) with my kids about six months ago and was completely blown away by how human-like it sounded, not just the voice itself but the way it expresses itself, the tiny pauses and hesitations, the human-like imperfections. It does feel like conversing with another human -- I've never seen another LLM do that.

(We mostly use it in car trips -- great for keeping the kids (ages 8, 12) occupied with endless Harry Potter trivia questions, answers to science questions, etc.)

cal85OP2y ago

This is great, thanks for sharing. Yeah the little imperfections work really well, it's the most humanlike computer voice I've heard so far.

ktosobcy2y ago

I wonder how it will work in real life and not in a demo…

Besides - not sure if I want this level of immersion/fake when talking to a computer...

"Her" comes to mind pretty quickly…

adren1232y ago

Indeed, the 2013 Spike Jonze movie is the first idea that popped-up to my mind when I saw those videos amazing to see this movie 10 years after it was released in the light of those "futuristic" tools (AI assistant and such)

ktosobcy2y ago

For me it's kinda scary - somewhat dystopian world where human interaction is very limited... and in the end the AI (spoiler)

wumbo2y ago

Siri comes off as impatient.

If you don’t complete your thought in one go, you have to insert filler words to keep it listening.

cal85OP2y ago

Yeah it's the worst. And 'um' doesn't seem to work, you actually need convincing filler words. It feels like being forced to speak under duress.

I've long felt that embracing the concept of the 'prompt' was a terrible idea for Siri and all the other crappy voice assistants. They built ecosystems on top of this dumb reduction, which only engineers could have made: that _talking to someone_ is basically taking turns to compose a series of verbal audio snippets in a certain order.

j452y ago

Is it new, or is it just a big jump forward?

The previous ChatAI app was getting pretty good once you learned the difference between run on sentences or breaking it up enough.

The tonality and inflections in the voice are a little too good.

Most people put on a spectrum/average aren't that good at speaking and communicating and that stands out as an uncanny valley approach. It is mindbogglingly good at it though.

https://en.wikipedia.org/wiki/Uncanny_valley

93po2y ago

im human and much much more partial to typing than talking. talking is a lot of work for me and i can't process my thinking well at all without writing.

jasondigitized2y ago

The good news is the interface will be multi modal. Talk, type, and I guess someday just think.

HarHarVeryFunny2y ago

> Humans are more partial to talking than writing

I don't think that's generally true, other than for socializing with other humans.

Note how people, now having a choice, prefer to text each other most of the time rather than voice call.

I don't think people sitting at work in their cube farm want to be talking to their computer either. The main use for voice would seem to be for occasional use talking to an assistant on a smartphone.

Maybe things will change in the future when we get to full human AGI level, treating the AGI as an equal, more as a person.

pdfernhout2y ago

When I was working at the IBM Speech group circa 1999 as a contractor on an embedded speech system (IBM Personal Speech Assistant), I discussed with Raimo Bakis (a researcher there then) this issue of such metadata and how it might improve conversational speech recognition. It turned out that IBM ViaVoice detected some of that metadata (like pitch/tone as a reflection of emotion) -- but then on purpose threw it away rather than using it for anything. Back then it was so much harder to get speech recognition to do anything useful -- beyond limited transcripts of audio with ~5% error rates that was good enough mainly for searching -- that perhaps doing that made sense. Very interesting to see such metadata in use now both in speech recognition and in speech generation.

More on the IBM Personal Speech Assistant for which I am on a patent (since expired) by Liam Comerford: http://liamcomerford.com/alphamodels3.html "The Personal Speech Assistant was a project aimed at bringing the spoken language user interface into the capabilities of hand held devices. David Nahamoo called a meeting among interested Research professionals, who decided that a PDA was the best existing target. I asked David to give me the Project Leader position, and he did. On this project I designed and wrote the Conversational Interface Manager and the initial set of user interface behaviors. I led the User Interface Design work, set specifications and approved the Industrial Design effort and managed the team of local and offsite hardware and software contractors. With the support of David Frank I interfaced it to a PC based Palm Pilot emulator. David wrote the Palm Pilot applications and the PPOS extensions and tools needed to support input from an external process. Later, I worked with IBM Vimercati (Italy) to build several generations of processor cards for attachment to Palm Pilots. Paul Fernhout, translated (and improved) my Python based interface manager into C and ported it to the Vimercati coprocessor cards. Jan Sedivy's group in the Czech Republic Ported the IBM speech recognizer to the coprocessor card. Paul, David and I collaborated on tools and refining the device operation. I worked with the IBM Design Center (under Bob Steinbugler) to produce an industrial design. I ran acoustic performance tests on the candidate speakers and microphones using the initial plastic models they produced, and then farmed the design out to Insync Designs to reduce it to a manufacturable form. Insync had never made a functioning prototype so I worked closely with them on Physical UI and assemblability issues. Their work was outstanding. By the end of the project I had assembled and distributed nearly 100 of these devices. These were given to senior management and to sales personnel."

Thanks for the fun/educational/interesting times, Liam!

As a bonus for that work, I had been offered one of the chessboards that been used when IBM Deep Blue defeated Garry Kasparov, but I turned it down as I did not want a symbol around of AI defeating humanity.

Twenty-five years later, how far that aspiration towards conversational speech with computers has come. Some ideas I've put together to help deal with the fallout: https://pdfernhout.net/beyond-a-jobless-recovery-knol.html "This article explores the issue of a "Jobless Recovery" mainly from a heterodox economic perspective. It emphasizes the implications of ideas by Marshall Brain and others that improvements in robotics, automation, design, and voluntary social networks are fundamentally changing the structure of the economic landscape. It outlines towards the end four major alternatives to mainstream economic practice (a basic income, a gift economy, stronger local subsistence economies, and resource-based planning). These alternatives could be used in combination to address what, even as far back as 1964, has been described as a breaking "income-through-jobs link". This link between jobs and income is breaking because of the declining value of most paid human labor relative to capital investments in automation and better design. Or, as is now the case, the value of paid human labor like at some newspapers or universities is also declining relative to the output of voluntary social networks such as for digital content production (like represented by this document). It is suggested that we will need to fundamentally reevaluate our economic theories and practices to adjust to these new realities emerging from exponential trends in technology and society."

Another idea for dealing with the consequences is using AI to facilitate Dialogue Mapping with IBIS for meetings to help small groups of people collaborate better on "wicked problems" like dealing with AI's pros and cons (like in this 2019 talk I gave at IBM's Cognitive Systems Institute Group). https://twitter.com/sumalaika/status/1153279423938007040

Talk outline here: https://cognitive-science.info/wp-content/uploads/2019/07/CS...

A video of the presentation: https://cognitive-science.info/wp-content/uploads/2019/07/zo...

lobochrome2y ago

I don’t know. Have you even seen a gen z?

cal85OP2y ago

I don’t follow, what about them?

ttyprintk2y ago

Something like this:

https://www.theonion.com/brain-dead-teen-only-capable-of-rol...

yard20102y ago

At some point in time someone said it about the boomers as well

3 more replies

lobochrome2y ago

They don’t exactly like talking into their phones. Texting works just fine.

interludead2y ago

> I think this changes things a lot.

Yeah, and it's only the beginging.

j / k navigate · click thread line to collapse

0 comments

kkukshtel2y ago

baq2y ago

Still patiently waiting for the true magic moment where I don't have to chat with the computer, I just tell it what to do and it does it without even an 'OK'.

(See also 'The Expanse' for how sci-fi imagined this properly.)

lannisterstark2y ago

We'll get there.

and it tells me if it's in stock, aisle and bay number. (If you can't tell, I am tired from fiddling with apps lol)

yakz2y ago

Lowes wants you to look at the ads in the app.

6 more replies

mycall2y ago

blagie2y ago

It's a process:

- People want agency.

- Once people have comfort and trust that it does things right enough of the time, people no longer want agency.

That threshold varies task-by-task and person-by-person.

2 more replies

IanKerr2y ago

Pretty interesting to me that we're starting to go from humans programming computers to computers programming humans.

1 more reply

narrator2y ago

I think the movie "Her" buried the lead. Why have a girlfriend in one's ear when one could have a compilation of great entrepreneurs multimodally telling you what to do?

kemiller2y ago

Re: The Expanse. I must have missed that. Maybe that’s the point. People no longer think of a computer as some separate thing that needs to be interacted with.

jdmn732y ago

nurettin2y ago

If you are not specific enough, it should at least ask for parameters.

    "Computer, buy some stock"
    *** buys 100 lots of tesla without a prompt

nanomonkey2y ago

Hmmm...maybe I should name my next company Vegetable or Chicken so that folks accidentally buy my stock. Sort of like naming your band "Blank Tape" back in the 90's.

2 more replies

krainboltgreene2y ago

This is called an "employee" and all you need to do is pay them. If you don't want to do that, then I have to wonder: Is what you want slavery?

baq2y ago

Actually yes, I want a blob of quantized floats to be my slave. I definitely don’t want it to be my policeman or philosopher!

1 more reply

nerdponx2y ago

trashtester2y ago

All of physics basically reduces to linear algebra locally (which becomes quite unilinear when enough tensors are multiplied).

Why shouldn't we expect AI to be created using the same type of math?

If there is a surprise, it's only that we can use the same math at a much higher level of abstraction than the quantum level.

nerdponx2y ago

That's a really nice analogy, I wonder if there is some research on this kind of thing in the machine learning literature already.

1 more reply

Taikonerd2y ago

After all the people likening this to the computer from "Her," I LOL'ed at you likening it to Plankton's computer from Spongebob.

apantel2y ago

This is why it would be such a mistake to kneecap this process over copyright. The models needs ALL the data.

QuiDortDine2y ago

Okay so we're all agreed that IP laws don't matter and we can have all of OpenAI's data for free? That's a good deal!

1 more reply

danbala2y ago

wslack2y ago

Then lawmakers should change the law, instead of a private actor asserting that their need overrides others' rights.

1 more reply

CamperBob22y ago

With you 100% on that, except that after you defeat the copyright cartel, you'll have to face the final boss: OpenAI itself.

Either everybody should get the benefits of this technology, or no one should.

1 more reply

nwienert2y ago

This is an anti-human ideology as bad as the worst of communism.

Humanity only survives as much as it preserves human dignity, let's say. We've designed society to give rewards to people who produce things of value.

These companies take that value and giving nothing back to the creators.

Supporting this will lead to disaster for all but the few, and ultimately for the few themselves.

Paying for your (copyrighted) inputs is harmony.

2 more replies

Der_Einzige2y ago

3 more replies

iAMkenough2y ago

I wonder how long until we see a product that's able to record workstation displays and provide a conversational analysis of work conducted that day by all of your employees.

iforgotpassword2y ago

Or comment on your coding in realtime with a snarky undertone.

If you give it access to the entire codebase at the same time that could work pretty well. Maybe even add an option to disable the sarcasm.

ianmcgowan2y ago

"What's your humor setting, TARS?"

heed2y ago

gpt-4o can at least summarize short meetings: https://x.com/gdb/status/1790164084425646481?s=46&t=8sSeDIGv...

bombcar2y ago

Zoom offers something like this, but it's really obviously geared toward a certain kind of meeting and has difficulty with anything else.

osigurdson2y ago

If it can't summarize into a single scalar value, it would be of little use.

cco2y ago

I believe that company is called Recall.ai

johannboehme2y ago

Thanks god i live in the EU and not in a late stage capitalist hell hole XD

dyauspitr2y ago

How did my direct reports do today?

They did fuck all, especially the ginger.

lottin2y ago

But in this case you're not talking with a real person. Instinctively, I dislike a robot that pretends to be a real human being.

irjustin2y ago

> Instinctively, I dislike a robot that pretends to be a real human being.

Is that because you're not used to it? Honestly asking.

This is probably the first time it feels natural where as all our previous experiences make "chat bots" and "automated phone systems", "automated assistants" absolutely terrible.

This is the first real contender. Siri was the "glimpse" and ChatGPT is probably the reality.

[EDIT]

AnthonyMouse2y ago

> Naturally, we dislike it because "it's not human".

That wasn't even my impression.

My impression was that it reminds me of the humans that I dislike.

It speaks in customer service voice. That faux friendly tone people use when they're trying to sell you something.

irjustin2y ago

> It speaks in customer service voice. That faux friendly tone people use when they're trying to sell you something.

1 more reply

rpdillon2y ago

You can ask it to use a different voice.

FabHK2y ago

The Khan academy video is very impressive, but I do hope they release a British version that’s not so damn cheerful.

irjustin2y ago

I wonder if you can ask it to change its inflections to match a personal conversation as if you're talking to a friend or a teacher or in your case... a British person?

1 more reply

PheonixPharts2y ago

> This is probably the first time it feels natural

It felt both staged and still frustrating to listen to.

And, like far too much in AI right now, a demo that will likely not pan out in practice.

jamil72y ago

This, everyone had to keep interrupting and talking over it to stop it from waffling on.

petereddy2y ago

I had the same reaction. I agree that it sounded very staged, but it also sounded far too cheerful and creepily flirty too. Unbearable.

ulchar2y ago

Emotions are an axiom to convey feelings, but also our sensitivity to human emotions can be a vector for manipulation.

Especially when you consider the bottom line that this tech will be ultimately be horned into advertising somehow (read: the field dedicated to manipulating you into buying shit).

This whole fucking thing bothers me.

maroonblazer2y ago

> Emotions are an axiom to convey feelings, but also our sensitivity to human emotions can be a vector for manipulation.

3 more replies

irjustin2y ago

> Especially when you consider the bottom line that this tech will be ultimately be horned into advertising somehow.

Tools and the weaponization of them.

This can be said of pretty much any tech tool that has the ability to touch a good portion of the population, including programming languages themselves, CRISPR?

I agree we have to be careful of the bad, but the downsides in this case are not so dangerous that we should be trying to suppress it because the benefits can be incredible too.

1 more reply

jasondigitized2y ago

5 more replies

Beijinger2y ago

"Naturally, we dislike it because "it's not human"."

This is partly right.

https://en.wikipedia.org/wiki/Uncanny_valley

interludead2y ago

> Siri was the "glimpse" and ChatGPT is probably the reality.

Agree. Can't wait to see how it'll be...

ambrozk2y ago

digging2y ago

Something can be revolutionary and have hideous flaws.

(Arguably, all things revolutionary do.)

astrange2y ago

I would say many pets pretend to be human beings (usually babies) in a way that most people like.

jimkleiber2y ago

astrange2y ago

They do, but they've evolved neoteny (baby-like cries) to do it, and some of their emotions aren't "human" even though they are really feeling them.

1 more reply

trashtester2y ago

Adult dogs tend to retain many of the characteristics that wolf puppies have, but grow out of when they become adults.

olddustytrail2y ago

> I think if my car started to say it felt excited for me to give it a drive, I might also feel uncomfortable.

Well, yes, you don't want to sit in a wet seat.

marcosdumay2y ago

They being simple and dumb works for their benefit.

Most people would never accept the same behavior from a being capable of more complex thoughts.

Art96812y ago

komali22y ago

sniggers2y ago

Not when they've replaced every customer-facing position. Oh and all teachers.

hoag2y ago

But of course this was the age-old debate with our favorite golden-eyed android; and unsurprisingly, he too received the same sort of animosity:

Bones was deeply skeptical when he first met Data: "I don't see no points on your ears, boy, but you sound like a Vulcan." And we all know how much he loved those green-blooded fools.

And let's of course not forget when he was on trial essentially for "humanity," or whether hew as indeed just the property of Starfleet, and nothing more.

mrexroad2y ago

s/Pulanski/Pulaski/

Sorry, had to be that trekkie :) and nice job referencing Measure of a Man — such great trek.

chiefalchemist2y ago

To your point, there's been a lot of talk about AI, regulation, guardrails, whatever. Now is the time to say, AI must speak such that we know it's AI and not a real human voice.

We get the upside of conversation, and avoid the downside of falling asleep at the wheel (as Ethan Mollick mentions in "Co-Intelligence".)

nsonha2y ago

I dislike a robot that's equal/surpasses human beings. A silly machine that pretends to be human is what I want.

interludead2y ago

It felt like a videogame for me

x3haloed2y ago

Exactly. I'm not sure if this is brand new or not, but this is definitely on the frontier.

I was literally just thinking about this a few days ago... that we need a multi-modal language model with speech training built-in.

This is MUCH more human-like, with the ability to interrupt each other and glean context clues from the full richness of the audio.

I honestly don't fully understand the implications here.

deanCommie2y ago

> Humans are more partial to talking than writing.

Amazon, Google, and Apple have sunk literally billions of dollars into this idea only to find out that, no, we aren't.

But when it comes to work, writing >>> talking. Writing is clarity of ideas. Talking is cult of personality.

And when it comes to inputs/outputs, typing is more precise and more efficient.

thehappypm2y ago

I think Alexa and Google Assistant simply are too low-intelligence to really consider it “talking” and not just voice commanding

jdietrich2y ago

deanCommie2y ago

And so are ChatGPT and Gemini even the newest launched versions.

enraged_camel2y ago

>> When I'm talking to my local barista I'm not just ordering a coffee, I'm also maintaining a relationship with someone in my community.

>>> But when it comes to work, writing >>> talking. Writing is clarity of ideas. Talking is cult of personality.

A lot of people think of their colleagues as part of a professional community as well, though.

cal85OP2y ago

throwthrowuknow2y ago

sroussey2y ago

It depends…

For example, I mentioned something to my contractor and the short thing he said back and his tone had me assume he understood.

Oh, he absolutely did not.

And, with him at least, that doesn’t happen when in writing.

achow2y ago

> Humans are more partial to talking than writing.

Is it so?

Speaking most of the time is for short exchange of information (pleasantries to essential information exchanges).

I prefer writing for long in-depth thought exchanges (whether by emails, blogs etc.)

In many cultures - European or Asian, people are not very loquacious in everyday life.

mrtranscendence2y ago

dmix2y ago

Time and place

I’m 100% a text everything never calls person but I can’t live without Alexa these days, every time I’m in a hotel or on vacation I nearly ask a question out loud.

I also hate how much Alexa sucks so this is a big deal. I spent years weeding out what it could do and can’t do so it will be nice to have one that I don’t have to treat like a toddler

insane_dreamer2y ago

(We mostly use it in car trips -- great for keeping the kids (ages 8, 12) occupied with endless Harry Potter trivia questions, answers to science questions, etc.)

cal85OP2y ago

This is great, thanks for sharing. Yeah the little imperfections work really well, it's the most humanlike computer voice I've heard so far.

ktosobcy2y ago

I wonder how it will work in real life and not in a demo…

Besides - not sure if I want this level of immersion/fake when talking to a computer...

"Her" comes to mind pretty quickly…

adren1232y ago

ktosobcy2y ago

For me it's kinda scary - somewhat dystopian world where human interaction is very limited... and in the end the AI (spoiler)

wumbo2y ago

Siri comes off as impatient.

If you don’t complete your thought in one go, you have to insert filler words to keep it listening.

cal85OP2y ago

Yeah it's the worst. And 'um' doesn't seem to work, you actually need convincing filler words. It feels like being forced to speak under duress.

j452y ago

Is it new, or is it just a big jump forward?

The previous ChatAI app was getting pretty good once you learned the difference between run on sentences or breaking it up enough.

The tonality and inflections in the voice are a little too good.

Most people put on a spectrum/average aren't that good at speaking and communicating and that stands out as an uncanny valley approach. It is mindbogglingly good at it though.

https://en.wikipedia.org/wiki/Uncanny_valley

93po2y ago

im human and much much more partial to typing than talking. talking is a lot of work for me and i can't process my thinking well at all without writing.

jasondigitized2y ago

The good news is the interface will be multi modal. Talk, type, and I guess someday just think.

HarHarVeryFunny2y ago

> Humans are more partial to talking than writing

I don't think that's generally true, other than for socializing with other humans.

Note how people, now having a choice, prefer to text each other most of the time rather than voice call.

Maybe things will change in the future when we get to full human AGI level, treating the AGI as an equal, more as a person.

pdfernhout2y ago

Thanks for the fun/educational/interesting times, Liam!

Talk outline here: https://cognitive-science.info/wp-content/uploads/2019/07/CS...

A video of the presentation: https://cognitive-science.info/wp-content/uploads/2019/07/zo...

lobochrome2y ago