How Alexa dropped the ball on being the top conversational system (opens in new tab)

(mihaileric.com)

180 pointsnutellalover1y ago213 comments

213 comments

> Amid this news, a former Alexa colleague messaged me: “You’d think voice assistants would have been our forte at Alexa.”

I assume the goal of Alexa was never to be the top conversational system on the planet, it was to sell more stuff on Amazon. Apple's approach to making a friendly and helpful chat assistant helps keep people inside their ecosystem, but it's not clear how any skill beyond "Alexa, buy more soap" was going to contribute meaningfully to Alexa's success as a product from Amazon's perspective. I saw the part about them having a "how good at conversation is it" metric, but that cannot be the metric that leadership actually cared about, it was always going to be "how much stuff did we sell off Alexa". In other words, Amazon did not ever appear to be in the race to make the best voice assistant, and I'm not sure why they would want to be.

robbiemitchell1y ago

It wasn't even set up for success at selling.

After years of raising 3 kids, you would think if I ask to add diapers to the cart, it would know something. But no, it would just go with whatever is the top recommended, or first in a search, or something like that. Nothing using the brand or most recent sizes we purchased.

There was no serious attempt to drive real commerce. Instead, Alexa became full of recommendation slots that PMs would battle over. "I set that timer for you. Do you want to try the Yoga skill?"

On the other hand, they have taken on messy problems and solved them well, but not using technology, and for no real financial gain. For example, if you ask for the score of the Tigers game, Alexa has to reconcile which "Tigers" sports team you mean among both your own geography and the worldwide teams, at all levels from worldwide to local, across all sports, might have had games of interest. People worked behind the scenes to manage this manually, tracking teams of interest and filling intent slots daily.

ClassyJacket1y ago

The insane lack of basic heuristics in every day apps to do very obvious things like you mentioned baffles me. They can come up with huge scale fuzzy vector search AI suggestion systems for a billion users, but can't think to do stuff like, only suggest things available in your size?

I'm actually working on an app that solves this for a specific use case, tho it isn't in the retail space.

Voice assistants are particularly egregious - they've done all that work to correctly recognise the words I said - i.e. the hard part - but then it breaks because I said "set reminder" instead of "create reminder"??

5 more replies

shortrounddev21y ago

If they rebuy your most recent purchase instead of the promoted brand, they don't get advertising revenue

1 more reply

ghostly_s1y ago

I laughed at this line: "Today Alexa has sold 500M+ devices, which is a mind-boggling user data moat." Yes, recordings of 500 million people saying "set a timer for ten minutes," and "order more paper towels" over and over again, truly a treasure-trove.

SavageBeast1y ago

My personal favorite is "Alexa! All Hue Lights ON" here - I only trust Siri with important cooking timers.

2 more replies

dhosek1y ago

For a while they were pretty much giving away the little Alexa dots. Buy some soap, get a free Alexa. I remember being unable to give them away and I think we ended up just trashing about half a dozen of them (vs the five active Alexa unites my ex-wife has. I personally have none in my home and plan to keep it that way).

burningChrome1y ago

> truly a treasure trove.

You forgot the part about it solving crimes.

https://broward.us/2023/07/18/amazons-alexa-is-surprise-witn...

montag1y ago

Imagine how many millions of “shut up!” “F—- you alexa!” Etc.

1 more reply

potatolicious1y ago

This is the core tension at the heart of Alexa that the author didn't address at all. It's not that Alexa is a bad product or that it wasn't cutting-edge, it's that it contributed very little to Amazon's other lines of businesses to justify the investment.

"Shopping with your voice" never took off despite many attempts. The contribution towards subscription services like Audible and Amazon Music was not substantial enough to warrant the massive R&D investment. The business unit never found any other sources of convincing revenue.

Every other decision is downstream from that unresolved tension.

ansible1y ago

Huh, interesting.

I've never used our Alexa for shopping. If I said something like "Alexa, buy more filters", even being very clever and looking at my order history, it would still get something wrong. And then I'd need to use another device to actually make the order.

While it seems to work fine on the speech recognition part, in that Alexa understands the words I say, it never seemed good enough to actually navigate a task like ordering the right kind of filter.

I knew there was some behind-the-scenes scripting going on, but I didn't realize just how much...

We mostly use our Alexa for kitchen timers, reminders, and video calls with family. Occasionally for playing music too. No, I don't want to subscribe to Amazon Music Unlimited.

ryandrake1y ago

> "Shopping with your voice" never took off despite many attempts.

We're seeing this more and more in tech: Company comes out with a feature that few people want. It doesn't gain adoption. They make many attempts to cajole and nudge users to use the feature. Users don't use the feature. They make more buttons and flows trigger the feature. Users ignore them. They start tricking users into using the feature, with dark patterns and misleading buttons. Users deliberately learn and avoid these. Exasperated, they declare "Why, oh why, won't users just use this feature!? They're just uninformed or don't know what's good for them!"

Whatever happened to starting with what the user actually wants and then working backwards from that to the actual feature? More and more, companies are more interested in serving their own metrics than serving their users.

3 more replies

malfist1y ago

I used Alexa for audible a few times and it was a nightmare. It wouldn't start where I stopped listening on my phone, and then when I stopped listening with the alexa it wouldn't record the correct timeslot back to my phone. So I either had to _only_ listen with alexa, or listen with everything else on my phone.

Guess which I picked?

ssl-31y ago

Shopping for household goods on Amazon is a minefield to begin with. Prices vary wildly from day to day, even on identical (same ASIN) products. And descriptions tend to be vague (is it Mediterranean oregano, or is it Mexican oregano? They're not the same plant, but it doesn't fucking say!). Per-unit pricing is often broken (is it 24 units of single cans, or is it 4 units of 6 cans each?).

Even when sitting in front of a real computer, it often takes fair amount of effort to find a product that represents the kind of value at the moment that I'm interested in.

Comparative shopping with this mess on the back end doesn't work with the current state of Alexa. There's details that are important to me, as a consumer, that can't be boiled down to a price and an 8-word summary.

If the back-end data weren't broken, buying with Alexa could be made to work if it could get a grasp (using ML or some other buzzword) of how a buyer's proclivities tended to be shaped. For instance, some people want the best per-volume price, and some others want the highest quality at any expense, with a huge range in between. I myself don't have a ton of room for bulk buying, so I often aim for a medium volume of moderate-high quality, tempered by a price that is low today.

But, again: The back-end data is broken, and Alexa is too stupid to make what I think are good decisions. When I can't trust the talking computer on my countertop to make good decisions for me, and if my hands are already full, I don't have time to have a drawn-out conversation with a bot, so I won't ever actually buy stuff that way.

It's not functionally better than Amazon's abortive Dash Buttons[0] from 8-ish years ago, which were also untrustworthy for many of the same (or related) reasons.

---

But if I'm cooking in the kitchen and I notice that I'm low on oregano, I do have time to say "Alexa, add oregano to my cart." And I'll also invariably make time to interrupt its misguided response with a quick "Alexa, shut the fuck up" once it starts prattling on about the useless summary from the bad back-end data (GIGO), so I can get back to doing what I'm doing.

This is important to mention because if I weren't already busy with my hands, I wouldn't bother with using Alexa at all for this task.

Eventually, I'll find myself in front of a real computer again and I'll go through and true up the things I've used Alexa to put in my cart, so they match my actual expectations, and actually buy some things. And while this is useful to me, it's obviously pretty far removed from the target goal of the system.

And it can't ever get better until they fix their data.

[0]: https://en.wikipedia.org/wiki/Amazon_Dash

snarf211y ago

100% agree. I don't want to converse with Alexa. I think the miss was not thinking about it as a media delivery platform with some utility. I use Alexa in the Kitchen and Bathroom when I'm doing other things. Hands free timers while cooking is great. But mostly it is listening to music. But they should have tried to be a #1 podcast source from day 1. I'd also like local weather and news from a local source. They could have made that another focus. Audio books and a lot of the other Amazon controlled content translates well to an Alexa. I think they focused too much on the smart house side and not enough on hands-free content consumption.

PaulHoule1y ago

Amazon could be at the late Sears and Robucks phase already where it cares more about people subscribing to Prime than it does selling things.

It’s painful to see them give up a good brand just as the moment when a change in technology could have given them wheels…

pbhjpbhj1y ago

For sales they gave up a good brand, what, 8? years ago when they stopped caring if what they were selling was even a real product and started taking part in the sale of products they know to be fake and/or rip-offs.

They cornered so many markets and, surprise, used that position to let every go to shit for a profit. Still at least Bezos got to wave his wang at the World by going to space.

SavageBeast1y ago

AMZN execs are watching AAPL stock go bonkers today on yesterdays AI announcement. APPLE IS DOING AI AND WE ALL NEED NEW PHONES TO USE IT!

I expect a similar thing to happen when AMZN announces some AI consumer product. Never mind they were in a Prime (ahahah - get it - "PRIME") position to be the first mover here.

An opportunity good and truly squandered.

1 more reply

HaZeust1y ago

Speculating that the brand behind AWS might be in the late Sears phase is hilarious to me.

1 more reply

vishnugupta1y ago

Absolutely! I’ve commented about it a few times on HN[1] as I was at Amazon at the beginning of Alexa of investment.

As with other projects Amazon’s plan seems to have been get big fast and figure out monetizing later. I’m sure ZIRP played some role in it and if not for rate hikes they might have kept it going for few more years.

But their aim from day 1 was to get millions of devices into customers home and then use that to boost e-commerce sales. When the second part didn’t materialize the initiative suddenly became a white elephant as it costs non trivial server capacity to keep the backend infrastructure running.

[1] https://news.ycombinator.com/item?id=40009295

planetafro1y ago

I feel like you are correct while thinking Apple's stance is the smarter play. My family ditched the entire Amazon ecosystem because Alexa was so utterly useless. Keeping people generally happy and entertained creates a positive mentality on the entire company as a whole. Let me tell you, having the kids on board is huge and prob a missed opportunity. You could literally be training kids to be little Amazon shills if this was done right. Mine loved talking to Alexa and then she got less and less intelligent and we swapped to Google's products -- same story there. Now it is pretty much a weather-bot and that is all.

blackeyeblitzar1y ago

> it was to sell more stuff on Amazon

Is that actually true? I cannot imagine that they are even marginally successful at that. In fact, I can’t identify what exactly Alexa succeeded at, beyond being a voice activated kitchen timer.

> that cannot be the metric that leadership actually cared about

I think the metric was promotions for Alexa employees, sort of like a lot of projects at Google.

ssl-31y ago

It's a networked, voice-activated kitchen kitchen timer, but it's a shitty one.

Suppose I put a roast in the oven and retire to my office to do something completely unrelated to cooking, where I cannot hear what happens in the kitchen.

One would think that I could set a timer in the kitchen and have it notify me wherever I am -- in the office, in the living room, on my pocket computer, on my desktop PC, or maybe even all of these things.

"Alexa, set a timer for two hours and notify me everywhere" seems like a perfectly cromulent thing to do.

But it isn't that way. Timers follow Vegas rules: Timers that that start in the kitchen stay in the kitchen -- they cannot be heard anywhere else.

It's not superior in any functional way to the old dumb digital timer on my oven, which has a VFD and a rotary encoder to set a timer.

(Which, by the way, has really marvelous ramps and responsiveness for that encoder -- it's silly-fast and efficient to give that knob a twist and dial in exactly what I want for a timer. Adjusting the clock for DST or whatever is equally fast and straightforward.

Except, fucking perplexingly: Alexa can notify me in the office when my oven timer beeps in the kitchen. This works fine.

All that is clear is that there is nobody steering this fucking ship.)

1 more reply

zitterbewegung1y ago

Alexa had a whole platform push giving away clothing if you made an app ( I have a nice hoodie from it) The thing they lacked was a way to get payments done through it. This is similar to Vine at Twitter where the core problem was monetization.

brevitea1y ago

I am literally laughing out loud, because Amazon had decided to fire majority of the Amazon Pay team, which is why there was no longer a clear way to handle payments.

leros1y ago

I turned off every ad setting they had and it would often end an answer with "by the way, did you know..." and then some Amazon promotion. I asked if it's raining, you don't need to tell me about an increase in Amazon Photo storage sizes. Especially given that it takes longer than answering my original question.

I got frustrated with that and tossed all my Alexas.

potatolicious1y ago

One thing really worth addressing from the post that I don't think author accepted, and I see this a lot with engineers:

> "That did introduce tension for our team because we were supposed to be taking experimental bets for the platform’s future. These bets couldn’t be baked into product without hacks or shortcuts in the typical quarter as was the expectation."

If I can pump one learning into engineers' and PMs' heads it's this: intermediate deliverables are not optional no matter how cutting-edge your team is.

You will never succeed if your pitch to leadership is "give us a budget for the next N years and expect no shippable products until the end of N years". Even if you get approved somehow at the beginning, there's a 99.5% chance your team/project will be killed before you get to N years.

Again, once again for the audience in the back: there is no such thing as a multi-year project without convincing, meaningful intermediate deliverables.

To clarify, that doesn't mean "don't have multi-year roadmaps", it means "your multi-year roadmaps must deliver wins at a consistent cadence".

Understanding this will carry you a lot further in the industry.

As a fairly cutting-edge R&D team part of your job is to figure out what slice of this is shippable (and worth shipping). If you're coming up empty you are not ready to pitch this to execs.

mlinhares1y ago

As an engineer it's much easier to "polish" (or work on useless stuff) than to deliver real products. I see that all the time, specially with "platform" teams that have no external product requirements. They waste a lot of time trying to figure out the most amazing way to deliver something instead of delivering something in a timely fashion.

If you push in any way they start to scream "tech debt" and everyone just accepts it. I've been through a migration mandated by an infrastructure team where where were 0 improvements for the teams that used the platform, all benefits were for the platform team only, and this was green-lighted and forced upon everyone without a second thought. It's unbelievable.

golergka1y ago

> If you push in any way they start to scream "tech debt" and everyone just accepts it.

Just as real-life debt, building a company without it is unrealistic and unwise. You just have to manage it.

1 more reply

epolanski1y ago

> To clarify, that doesn't mean "don't have multi-year roadmaps", it means "your multi-year roadmaps must deliver wins at a consistent cadence".

What you describe is exactly the opposite of research: which is collecting neverendin failures.

An environment that lives by such logic cannot really lead to major technological breakthroughts. And in fact, Amazon has very little of those to show compared to the rest of the SV.

bigstrat20031y ago

Not all places are doing research. In fact most are not! Knowing whether you're at a company willing to bet money on R&D, or whether you're at a company that wants you to come up with actionable, is pretty crucial. As you said, Amazon is not really out there doing research - so engineers working there would do well to assume they need deliverables to keep managers happy.

1 more reply

snom3801y ago

I think what you say is true, yet some base research takes time. If you have a “ship it” attitude, you might push teams towards taking smaller bets that they know are within reach? I don’t know how the transformer breakthroughs at Google/DeepMind happened, and it’s likely they were “shipping” things internally, but it seems clear that the people on those teams were working in a very different environment than what the author is describing.

If you look at all the defining products of Apple, they also took years from the “germ of an idea” until they could be launched, and though they might have “shipped” internally, they gained a lot by not having pressure to ship things piecemeal to customers.

openmajestic1y ago

Google Brain spent oodles of money developing that tech only to watch other people capitalize on the research and potentially make Google Search (one of the stickiest products I've ever seen) obsolete. Freeform, self-directed, open-ended research labs are certainly a great approach from a technology breakthrough PoV, if you have 2010s Google margins.

But it's not obvious to me that approach was even a net win for Google as a business. Did Google Brain invent the technology that killed Google? TBD I think.

1 more reply

malfist1y ago

This _highly_ depends on your field and what you're working on.

Working on the latest and greatest social media website? Sure, ship early, ship often.

Working on medical devices? You better not ship a prototype.

Working on hardware? Too expensive to pivot from learnings, better get it right the first time.

Working for NASA? You better get it right the first time and predict all future issues that might be possible, and you better document it 9 ways to sunday.

buildbot1y ago

The parent's point though is it actually doesn't. For example with NASA, there are 1000s if not 100s of 1000s of intermediate goals on a project like SLS. For example, IDK, make sure the engine hits 95% of thrust spec. Hardware? You design each IP in isolation, test in isolation, build up step by step. A0 tapeout, A1 tapeout...etc.

iknownthing1y ago

It's a catch 22 with ML though. What you wrote is completely true however with ML you cannot say "We will get to 98% precision and 92% recall by Q4". You do not know how long it will take (see self-driving cars). However of course you can always lie.

potatolicious1y ago

> "It's a catch 22 with ML though. What you wrote is completely true however with ML you cannot say "We will get to 98% precision and 92% recall by Q4"."

This applies also to ML - it applies to all tech projects, though yeah, it's harder in ML. But not figuring out the intermediate products is not an option though - your stuff will get killed prematurely if you don't.

The trick with ML is not to promise "98% precision and 92% recall by Q4", it's to figure out what kind of product is shippable with lower precision and recall. Or perhaps a stepping-stone model that allows some simpler use case, but gives you progress towards the greater goal.

It's always case-specific, but as a ML team you do need to figure out what your intermediate checkpoints are. You need to demonstrate not only progress, but that your progress is contributing to the company's goals.

2 more replies

buildbot1y ago

1000 times this - you need to deliver something. Even very long projects such as medical research have intermediate stages and goals to meet.

Very experienced people tend to forget this from time to time too and get excited or convince themselves "big risk big reward"... I've never seen that work out.

jll291y ago

That's true, and it justifies why some things are still better done in a university environment (or as part of an academic collaboration) - it's not possible for everything to carve out something useful in the middle.

Executive patience, focus and planning horizon are the immediately next 1-4 quarters, years 2 and 3 and perhaps 5 if you are lucky and that's it, and they might not even be around in five years.

In academia, if you are stubborn and tenured and don't care about your short term success (publications, citations, awards) you can actually decide to implement a very long-term vision, depending on how much additional funding you need (if that is a lot, you will need to also convince funding agencies or philanthropists of your vision).

j7ake1y ago

Even in academia, short consistent wins are needed to ensure your lab stays open.

Heck even Andrew Wiles, someone who only needed pen and paper for research, had to publish papers during his 7 years working on Fermats last theorem.

mips_avatar1y ago

Big tech isn't able to do certain innovations. It's not that intermediate wins don't matter, but that innovative capabilities don't come out fully formed with a business model. And I don't know of any big techs that effectively do business model and tech innovations at the same time.

If your goal is only promotions inside of big tech then definitely throw most innovative ideas out the window. But if you're interested in innovation, then big tech either needs to get a lot more entrepreneurial or less big.

1 more reply

notyourwork1y ago

I agree with this. Said more concisely, have a long term vision and an incremental path to get there. With each step or two, re-evaluate the long term vision. It may change, it may not. If it changes, update your incremental steps, if it doesn't you have more confident you're building the right thing.

ryandrake1y ago

IMO voice assistants keep underwhelming because customers don't want to learn their language--we want them to understand our language. It's frustrating to have to know the exact magic spell wording to get an assistant to do something. They need to be like "Computer" in Star Trek TNG or they're dead in the water. 80% of the way there still isn't good enough--I'd rather just use a mouse and keyboard than endure an awkward trial-and-error with an "assistant" that is supposed to be smart.

kalleboo1y ago

Although ironically, the biggest complaints I hear about the current crop of assistants is the lack of consistency. People are fine learning a command their assistant can do "turn on the living room lights", and then about 20% of the time, the assistant refuses to perform the action with the exact same command as yesterday and needs some slightly different variant ("turn on the light in the living room"), just that one time. The assistants would actually be BETTER if there was a documented, precise, syntax you could learn.

pomian1y ago

So you pick up the mouse, and talk into it? .... Seriously though, I don't understand why the state of voice interaction is so poor. In the 90's we had voice commands (early dragon ?) available to tell our computer what to do. It was limited, but it worked extremely well, even in busy environments. I remember my Thinkpad 486dx2(?), at a party - opening software and choosing music to play from a list, and controlling volume, all by voice. Thinking to ourselves, imagine what this will be like when we have a stronger, faster computer, in five years.

ryandrake1y ago

It's truly gone nowhere. Still, the most advanced thing you can reliably get it to do is "Set a timer for 10 minutes."

I wonder if these "SmartAssistant" programmers ever actually had a human personal assistant. For most of what you need them to do, you don't even ask them to do it, they just know you and do it. An actually good computerized SmartAssistant would know that it's been a year, so it's time to book my physical with my doctor. It would have contacted the doctor's office for me, checked my calendar, scheduled the appointment, and then proactively reminded me a few days in advance. I shouldn't have to say "Hey, Assistant: Please schedule a physical for Doctor X at Clinic Y on July 1 of this year." (by the way SmartAssistants can't even currently do that).

The voice interaction should only be for exceptional cases: "Hey, Assistant: My trip to the Paris office needs to be delayed by one week." The assistant should then go and re-book flights, hotels, and rental cars, and then when finished, merely say "Done."

Until they can do this, tech companies might as well stop bothering releasing incremental crap products that can barely understand a task I'd expect a 4 year-old to be able to do.

1 more reply

brookst1y ago

I think it was scalability to languages, dialects, idioms, etc. Super easy to have high quality American English with a few commands. Much harder to support any language, any syntax, any accent. The brute force optimizations just don't scale.

Modern ML and embeddings models are the discontinuity that was needed to get from "massively complex hack that can't scale" to "even more complex but principled approach that scales pretty well".

johnyzee1y ago

If only there was a technology that understood unstructured natural language really well.

nextworddev1y ago

Hmm, I'm an early Alexa PM (2016) that left Alexa before the OP joined it (2019).

Alexa's main failure was mainly that the tech wasn't ready - it was basically a ASR + NLU + rule engine. If we had 2023 LLM tech, then we may have "won" the Assistants market.

Yes, organizational bloat and politics was a problem but OP was hired as a result of the mass hiring spree, so he was a beneficiary of that.

dbish1y ago

Also ex-Alexa in the early years, while that would have helped, I really don't think it was just the tech stage. As we grew it was very hard to work on the core tech at all since we had a bunch of time spent on organization issues and working with people who 1) didn't get how ML worked or pay attention to how the tech is changing (most of the "domain" teams) and wanted to just ship new integrations or demo features and 2) never found a monetization channel which would have allowed for investment into deeper tech (and less senior leadership/exec churn).

Though I also very much agree with the other point of OP that privacy paranoia also blocked development. The privacy team seemed like they would have been most happy if we couldn't ship.

nextworddev1y ago

the privacy thing became a real roadblock due to GDPR emerging after the whole Facebook scandal.

can confirm that several of my launches got delayed to bolt on GDPR

mips_avatar1y ago

I wouldn't really call OP a beneficiary of it. As much as I like drawing a paycheck I don't want to work for an organization that's set up to keep me from succeeding. I imagine OP feels that way too.

krisroadruck1y ago

Honestly I think the only reason you guys shipped as many units as you did was blocking other voice assistants from interacting with Audible. I've had an Alexa/Echo since the early beta and it has literally always been worse than other options like google home. It's gotten progressively worse over time as well. Not just comparatively but vs it's own experience several years ago.

nextworddev1y ago

Yep, can confirm that it was better in 2017 than 2021.

Mainly because when the org chart grows, more "rules" are added to the rules engine, where each rule is managed by another service... which all adds to end to end user perceived latency, etc... that's why rule engines don't work.

ratg131y ago

I have a first gen echo and generally it has gotten better over time.

In the early years I couldn’t control the Phillips hue lights in my home, and then one year suddenly I could thanks to updates.

Most companies would have abandoned hardware this old.

2 more replies

pbhjpbhj1y ago

They don't seem to have modded Alexa to keep pace with accessibility of the pubic to LLMs though? Presumably they'd have just let that market lead slide in the same way.

nextworddev1y ago

They are actually planning to do that - it's just a software update - though they are doing that with Amazon Titan model, which might be kind of behind..

Also they are trying to sell as a subscription which is interesting since Siri is free.

ldjkfkdsjnv1y ago

I worked on a research team in Alexa. Almost all projects were focused on short term delivery, non immediate need innovation was highly discouraged. Most models were just direct imports of open source models created by Meta/Google. Extremely short delivery timelines for incremental improvement. Alexa employees were often highly political tenured Amazon employees. Minimal room for growth as they sucked oxygen out of the room.

The Amazon philosophy of constant execution is at odds with large leap technical innovation. It works very well for ops heavy AWS orgs, and supply chain related optimization problems. The company has a cultural problem

Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

criddell1y ago

By the way, oxygen is a colorless, odorless gas with an atomic number of eight.

If you would like to know more about elements just say "Alexa, tell me about the periodic table of elements".

djeastm1y ago

Alexa's "by the ways" will be its downfall.

What kind of assistant says "by the way" apropos of nothing? An annoying one. It's just a thinly disguised ad that was never asked for.

chrisjj1y ago

Oxygen gas is molecular not atomic - at least on this planet.

1 more reply

esafak1y ago

You sound like Clippy. "It looks like you're writing about oxygen! Would you like to know its chemical properties?"

rubyfan1y ago

My understanding of Alexa is the thing doesn’t support a business of its own so it’s beholden to showing value to the enterprise through soft or derivative metrics like engagement or LTV. I believe that when a venture doesn’t stand on its own value within a larger enterprise it’s doomed to corporate politics and external optics. This has an ultimately shorter shelf life than something that makes money, in some companies it might be greater than others.

elbasti1y ago

> Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

That would be a data point in favor of the amazon strategy, no? Prevented millions upon millions of being invested in developing losing technology.

The window to integrate LLMs (especially from Antrhopic, in which Amazon is an investor) is closing but not shut.

If they can do so well, they have massive distribution power to catch up and drive rapid adoption of alexa 2.0.

ldjkfkdsjnv1y ago

Honestly, yes it is in favor.

1 more reply

iknownthing1y ago

Interesting given how many products out there now are negatively described as ChatGPT wrappers. Sounds like Alexa was just an open source model wrapper.

jjtheblunt1y ago

> Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

that's overstated, because you accidentally lumped speech recognition in, and i imagine Nuance (https://nuance.com) and others are like "hold my beer".

realfeel781y ago

Nope. OpenAI's actually-open-source Whisper blows Nuance and all other legacy speech recognition tech away.

1 more reply

neilv1y ago

The first paragraph of the first section of reasons given is:

> Alexa put a huge emphasis on protecting customer data with guardrails in place to prevent leakage and access. Definitely a crucial practice, but one consequence was that the internal infrastructure for developers was agonizingly painful to work with.

I really don't want this to be a message companies are hearing right now -- that being conscientious about customer data is a lethal barrier to progress, in the "AI" gold rush.

Also, without knowing anything about the organization, I'd expect it to probably have a high level of dysfunction, being at a company known for being excessively metrics-driven from the top, and for ruthless stack-ranking and related HR practices... trying to organize a large coherent cutting-edge R&D effort against that cultural backdrop. Like suggested by this bit elsewhere in the section:

> And most importantly, there was no immediate story for the team’s PM to make a promotion case through fixing this issue other than “it’s scientifically the right thing to do and could lead to better models for some other team.” No incentive meant no action taken.

wepple1y ago

Whilst there may be some inefficient bureaucracy in large orgs, I’m tempted to offer the following proposition:

Companies that are absolutely at the forefront of AI, must be by definition, doing terrible things wrt privacy & security.

pessimizer1y ago

> I really don't want this to be a message companies are hearing right now

They won't be hearing it, they'll be (and are) sending it. "AI would be better for you if it had fewer guardrails around your privacy. Trust us."

johndhi1y ago

You may not want privacy guard rails to be contrary to progress in AI but I'm afraid they may be.

keeptrying1y ago

At Amazon you get paid when you create whole new businesses which you used to pitch to Bezos.

Incremental improvement was rewarded through the regular stock and pay process.

Thus no one cared enough to quickly switch to LLMs.

Suoer interesting how org design - even when brillaint can be severely lacking.

orwin1y ago

And I think they're better for it.

We use both azure ans AWS at my current org. We recently had an internal 'hackathon' to try Llms in both orgs (Claude for AWS, ChatGPT4 for Azure) on our knowledge bases.

Clearly, we couldn't differentiate them on response quality, not in 3 days, but on how easy and integrable the LLM was, AWS was superior, even for our mostly Azure teams, weirdly.

laidoffamazon1y ago

I don't know if that's really true - I've heard people who spin up successful teams can get bonuses (which basically don't exist at Amazon) for doing so. In that sense, incremental improvement isn't really rewarded at all - there's very little to no pay-for-performance as a concept at Amazon.

malfist1y ago

We're one of the few companies that count stock growth against you during comp reviews.

1 more reply

markx21y ago

I bought into Alexa/Echo because at the time my late wife could not work remote controls (Multiple Sclerosis). She only had her voice.

It was great for her to play different radio stations, playlists, news. It did the job.

I did try linking it to a TV but that was terrible. Slow, janky, unreliable.

Since she died - Dec 2018 - "Alexa play LBC" "Alexa stop"

Oh, if you do have an Alexa device "Alexa, what noise does a hamster make?"

djeastm1y ago

> Oh, if you do have an Alexa device "Alexa, what noise does a hamster make?"

I just did this and it said "Here's a hippopotamus' grunt"

So that's how it's going for Alexa in my house

brevitea1y ago

AGI Alexa is truly a work of innovation. It gaslights you into thinking you wanted hippopotamus.

laidoffamazon1y ago

I was in Alexa and this rings painfully true. So many workarounds and endless classification escalations. The customer-data certified compute environments were extremely painful to use (though later improved but still annoying) and getting data in or out, even for anodyne reasons, was nigh impossible. For a long period, even getting access to this system (called Hoverboard) took months. During my internship I spent about half of it waiting for access to be granted and had to spend a big chunk of it testing out my training system on CPU...not fun.

lcnPylGDnU4H9OF1y ago

I get the feeling that you weren't intending to say so many kind things about the access control practices for Alexa data. Sounds like good work.

laidoffamazon1y ago

This was a common complaint in Alexa and one interesting bit of insider information I found out from people that either left for Google Assistant or came from there is that their tools did not have this level of pain around data security. As a result I'm guessing there's a middle ground between customer data security and making it impossible to work with - either that, or Google just didn't care.

1 more reply

GJim1y ago

You're complaining that personal data was well secured?

This should be a given. The fact you think otherwise is worrying.

(Yes, data security legislation does introduce barriers. Tough. Get used to it.)

labrador1y ago

Amazon invested $4 billion into Anthropic [1], which makes Claude AI, the AI I use almost exclusively. Claude is being trained to be very human-like, safe and friendly and has improved very much in that area lately [2], so I assume the idea is that Claude will be Alexa 2.0. In my experience, even employees maintaining the current version of a product can be out-of-the-loop on the future of the product.

[1] https://www.aboutamazon.com/news/company-news/amazon-anthrop...

[2] https://www.anthropic.com/research/claude-character

AnotherGoodName1y ago

In fact I think you sometimes have to ensure the old guard is gone to let the new guard take over. Google and Apple have been in the news with cuts to their voice assistant teams in recent times too, similar to the Alexa cuts.

It’s obvious current ai can handle context reasonably well which is something the previous voice assistants failed badly at. The next thing is to write all the apis so the ai can reasonably act on that context. It’s such a new way of doing things it’s probably best to hard cut development of the previous way to do it which was seemingly hard coded triggers->actions which always failed badly if you wanted to add context eg. “open house blinds when I pick up the phone in the morning or when the alarm goes off, whichever comes first” would never work with the old voice assistants. It might just work with the new ai systems but it’ll be a completely different system, not even a rewrite at that point.

Obscurity43401y ago

AIPIS

hawski1y ago

After Apple announcement they should name it Amazon Intelligence.

brookst1y ago

Pity Google, Microsoft, Meta didn't have the foresight to name their companies in preparation for the AI boom. Adobe, Autodesk, Alibaba, and AirBnB are looking pretty smart.

I suppose this might explain the Google -> Alphabet thing but they haven't really embraced the new corporate name enough for "Alphabet Intelligence" to make any sense.

throwinga101011y ago

The infra part particularly resonated with me. Leadership caring about high level metrics deprioritize "in-the-weeds" dev experience. At <redacted applied ML place>, I was working on data infra, but pressure from above made it impossible to have time to focus and instead focused on short-term deliverables. It was hard to convince others to adjust their priorities for "the greater good".

The only way to make things better (in my mind) was to use my own time to improve the infra, and because the metrics don't track these infra improvements I don't get rewarded so I just became burned out.

Part of me think this is the reason why you want bloat in orgs, so that motivated people with enough redundancy will actually feel comfortable chasing longer term incentives.

zhyder1y ago

I think we overstate how useful such a system is. Visual UIs with buttons are so predictable and efficient to use, so anytime you can reach for a screen and hand-operated-input for anything complex, you will.

I've stopped using home devices like the Echo (coz of privacy concerns, esp with hotword mistriggers): now use voice only when driving the car. Maybe multimodal LLMs like GPT-4o will spawn new useful use-cases, but I think they're unlikely to be for the same use-cases Alexa the product+brand is known for.

hn_throwaway_991y ago

Exactly - Amazon wasn't the only company gutting their conversational UI teams because they couldn't figure out how to make money with them. My guess is 99% of conversational UIs are some combination of:

1. Set my alarm/timer.

2. "What's the weather?"

3. "Turn on/off my lights" for those with connected lights.

etc.

We've had voice calling for over a century, yet it feels like the majority of us prefer to text most of the time these days.

wenc1y ago

> We've had voice calling for over a century, yet it feels like the majority of us prefer to text most of the time these days.

It depends what culture you’re from. Many cultures around the world prefer voice. If you live in a fair large city just look around for folks on the phone. There are still many people who need a voice plan.

1 more reply

empath751y ago

The problem with all the original iterations on these assistants is that they fail so often that people stop using them for anything except for the few things that they can do reliably which for most people is basically setting timers, playing music and turning the lights off and on.

They're annoying to use, because the interface sort of implies affordances (like, you know, just talking to it like a person) that aren't actually available, and really it's just a menu tree that's barely more sophisticated than a customer support call tree.

ultrarunner1y ago

In a situation with such a low margin of value for the user, there's an even finer point on reducing friction. I know Alexa stopped being used in my household shortly after the "by the way…" addendums were introduced. This seemed to be in recognition of the trend of reduction to memorized phrases, but it was a bad approach. If the other comments are true, and incremental progress was more rewarded, it stands to reason that such a major problem with the service was under appreciated. Had it been recognized as the dead-end of the product, maybe more sophisticated models would have been pursued.

ghaff1y ago

Right. You develop a small number of simple, useful, common tasks and you pretty much just stick to those.

tssva1y ago

My guess is that there is going to be all this investment and research poured into AI voice assistants and in the end the result is going to be the same.

danielbln1y ago

What makes you guess that?

1 more reply

TheGRS1y ago

Good read and really just highlights the complexity and tension involved with huge corporate organizations. I would not have ever guessed that Alexa alone would have so many teams and engineers involved, because from the outside it seems like the only iterations were on the physical models. The voice assistant didn't seem to change in any meaningful way for a very long time. It even seems like Amazon employs some form of internal start-up model, but that still struggles because of the internal politics. Maybe when it comes to individual products, its best to keep the teams small and nimble.

suyash1y ago

Siri dropped the ball even before Alexa, it had the golden opportunity to become ChatGPT about 10 years ago.

darby_nine1y ago

I will invest a lot of money in the first voice system that offers serious text editing so i can correct mistakes without repeating everything.

sambull1y ago

I can't even do that with other humans - or they can't do that with me, or some combination of that.

evanmacmillan1y ago

Gridspace —> http://www.gridspace.com

MH151y ago

There was something on Show HN that did this with some promise recently, can't find it now

pneumic1y ago

Aqua Voice: https://withaqua.com/

add-sub-mul-div1y ago

Yeah I'm impressed with how well it does a straight shot paragraph but I can't use it comprehensively. It's the little things that make it unworkable, like moving the cursor and selection to do rapid nonlinear edits, getting it to use your personal capitalization style, and I'd imagine it's a nightmare trying to write code with voice dictation, with how nonlinear and dependent on symbols and spacing it is.

webappguy1y ago

GPT voice is almost exactly always what I want, I think using whisper, and yet every app and both OS devices android and iOS can’t even transcribe a single sentence correct without errors. Pathetic really.

RheingoldRiver1y ago

i want talon voice for android

rco87861y ago

3 nights ago at about 1am my Alexa started blasting heavy metal music at 1am.

I unplugged it and am not too sure about plugging it back in.

thiagoeh1y ago

Someone, maybe a family member, has an Spotify account connected to Alexa, and chose your Echo device as output

mh-1y ago

I accidentally did this to a vacation home we stayed in once, like weeks after we checked out. It took me a solid minute to piece together 'where the music was going'. I felt so bad; hopefully the house was vacant.

dpkirchner1y ago

About a month or two ago Alexa started telling us it didn't understand "stop" while it played the audio we didn't ask for (Alexa's voice recognition is really poor). I had to unplug it.

I probably could have used the app to stop it but I didn't think about that at the time.

stderrout1y ago

Always surprised how little Alexa changed over the years from initial version. Both on software and hardware there could have been so much possibilities and learning from years of user feedback. Appstore and skills deserted from years of broken to unusable functionality. Hardware remain stuck to initial speaker version where there was possibility to replace so many dedicated boxes around house each connected to power supply providing singular function - Access Point, routers, smart device gateway, streaming device, NAS for backup or private cloud, etc

Last one failure for entire BigTech where desire to maintain control prevented any form of standardization or interoperability to the point where hobbyist open source solutions are now leading on how to do smart home right way and not abandon user base in 6 months after release.

oidar1y ago

They haven't lost yet. Alexa is still the top conversational system appliance for most households right now. A smart speaker is almost an essential device among my group of friends. Siri on HomePod is still subpar for basic kitchen tasks like setting timers, adding items to shopping lists, and creating reminders. I don't even think Google has a current device that competes in this space.

I really want HomePod to be better at household tasks such as managing shopping lists, timers, and reminders, but it's not there yet. As soon as the HomePod can replace my Alexa devices, I'll be all in. I have a HomePod right next to every Alexa device in my house, and I'm just waiting for Apple to turn on their "Apple intelligence."

sporedro1y ago

Does siri/homepod not just directly connect to your iPhones calander/notes/reminders?

I honestly ask this because I never tried though… I use my homepod as a glorified timer, alarm clock, and speaker. I’m just sitting here in the apple ecosystem hoping one day things will actually feel connected.

oidar1y ago

It doesn't "directly" connect - it changes things on iCloud - it just doesn't work well. It takes more than a few seconds for it to add things to reminders. It has to verify your voice to add things to reminders - god forbid if you are sharing a shopping list with another person in your household. That nearly doubles the time it takes to add something to a shopping list. With Alexa, adding things to a shopping list is instant, it doesn't verify if you are authorized to add things to the shopping list, Alexa just adds them.

datpiff1y ago

> A smart speaker is almost an essential device among my group of friends.

You live in a bubble

GJim1y ago

> A smart speaker

Can we give these things their real name: Smart Microphones

Alexa = Amazons microphone.

creeble1y ago

I'm not sure I would have a lot of use for voice-controlled conversational AI, as odd as that may sound.

Like most people, I use Alexa for _commands_: home automation, timers, tell me the weather, ask a specific question looking for a specific answer, play this music. That's not "conversational", and I don't want it to be.

I use generative AI for other things, mostly writing code for me, or telling me about code problems in general. It's rare that I want output that I'm _not_ going to copy/paste somewhere.

Alexa isn't a failure, it just didn't sell more stuff for Amazon. And, well, it costs an awful lot for them to keep running. So maybe it is.

_yb2s1y ago

I don’t know if my voice is weird or what, but these things- alexa, siri, etc. don’t work for me at all, they can’t understand anything I say unless I repeat it slowly half dozen times, yet regular people understand me just fine.

brookst1y ago

My s/o talks weird to devices... she pitches her voice oddly and it's hard for even me to understand her. Devices do not understand. Maybe record yourself or ask someone to listen to you talking to a device and see if you're unconsciously changing delivery?

jedberg1y ago

Voice assistants are much better with lower voices than higher ones (especially Alexa). One tip is to deepen your voice, that might help.

_yb2s1y ago

My voice is extremely, unusually deep. I actually suspect that is the problem itself, it is deep enough that it doesn’t even register as speech to these systems.

jarjoura1y ago

I had a manager who came from Amazon and shared some of the horror stories from their time on Alexa. It seemed like a lot of senior folks were only using it for career advancement because it was where all the R&D money was flowing into. So a bunch of hard working folks building things, leadership playing political games with each other, and an org that had no idea what problems they were actually trying to solve.

It definitely captured the market, but without a top down vision, the whole thing was just a huge letdown.

blackeyeblitzar1y ago

The problem with Alexa isn’t technology or product design. It’s the organizational issues. It was a bloated team with over 10000 people. After a decade of investment, they had no real business model. They may have had decent engineers and scientists, but lots of managers and executives got promoted to very high levels despite showing zero results, mostly based on the size of their teams. Those empire builders then moved around the company and industry, leaving behind a wasteland.

esafak1y ago

This is an industry-wide problem. The solution is not to overhire and crack down on these empire bulders, but this means fewer FAANG jobs. Engineers have to pick their poison.

tlogan1y ago

I set up our home with smart shades, lights, iAqua, Somfy, etc., but my family and me now just use the remotes due to frustration with Amazon Alexa.

I was always under impression that Amazon uploads all our data because I notice data transfers whenever I use voice commands, which makes me doubt their privacy claims.

It seems like Alexa was designed more to learn from us rather than to genuinely assist us. Its primary goal appears to be gathering data rather than helping users.

bigstrat20031y ago

> I was always under impression that Amazon uploads all our data because I notice data transfers whenever I use voice commands...

As I recall, they said as much. The device uploads a clip of the audio to get processed by the back end, does it not?

drcongo1y ago

We've had weeks and weeks of "Apple have dropped the ball over AI" so now it's Alexa's turn I guess.

jackconsidine1y ago

> Alexa put a huge emphasis on protecting customer data with guardrails in place to prevent leakage and access

For what it’s worth we were working on a conversational health app and this why we picked Alexa over alternatives (if you’re big enough to get on GPT enterprise you can probably implement HIPAA safeguards, but we never got replies)

pciexpgpu1y ago

You don’t need to bring a rocket launcher to a banana fight.

Most of the queries are gonna involve setting an alarm or turn on/off a thing.

They didn’t drop the ball- they were very customer savvy and really knew what they were getting into.

pjs_1y ago

I remember talking to Alexa about eight years ago. I was a bit skeptical and asked, "what color is a red car?". I was seriously impressed that it knew that a red car is red.

booleandilemma1y ago

I threw my Alexa in the garbage after I asked it to turn off my house lights and it tried to recommend something (a schedule or something) for what must have been the third time.

mrkramer1y ago

I never used smart speakers but do they allow third party apps and if they do how developed is the ecosystem? I see so many use cases as of voice commanded apps, voice games etc.

barumrho1y ago

Alexa is not on a phone or PC. It could only be so useful being available only at home.

jedberg1y ago

Sure it is. There are Alexa phone apps that do full Alexa.

Up until about a year ago you could also do it on the computer but they took it down. https://alexa.amazon.com/

brevitea1y ago

... Because it was a huge privacy risk.

apeace1y ago

I've always thought it was Apple who dropped the ball with Siri.

When Siri came out in 2011 -- two years before Alexa -- all my coworkers and I had iPhones. I remember sitting in my office as people yelled at Siri all day trying to get her to be useful. "Hey Siri, what's the weather tomorrow? No... No SIRI, WHAT'S -- THE -- WEATHER -- TOMORROW!"

Even though it sucked, it seemed every hardcore Apple user was ready to jump onboard. Who cares if I'm in a crowded office with people trying to get work done while I spend 10x longer to perform a function in the noisiest possible way? I'm using this thing!!

The voice recognition has improved since then. But the functionality still sucks.

When I'm in private, there are a couple commands I'll use.

- "Hey Siri, call xyz" where xyz is someone in my contact list I have tested with Siri and is known to work. Not recommended to try without testing first.

- While cooking, "Hey Siri, set a timer for 10 minutes." Works great.

- While driving and navigating: "Hey Siri, take me to the nearest gas station." That one is pretty good, except the actual maps are not smart enough so sometimes you'll be turned around in the opposite direction you were going, since technically that's where the nearest gas station is.

I never understood why they couldn't make this tool better, even before LLMs and without any AI at all. Just hard-code a bunch of phrases, and ways to translate those phrases into some action.

"Hey Siri, how close is my UPS delivery?"

"Hey Siri, where can I get the best price on xyz cat food?"

"Hey Siri, what's my bank balance?"

"Hey Siri, how much is a Lyft to xyz?"

I bet if they had a single developer working on adding Siri commands full-time, they could announce something like 20-50 new Siri functions at every WWDC.

But it seems the goal now is just "Make it an LLM," instead of focusing on recognizing the task that the user wants to do, and connecting it to APIs that can do those tasks.

They could've dominated the "conversational system" market 13 years ago.

noahtallen1y ago

> But it seems the goal now is just "Make it an LLM," instead of focusing on recognizing the task that the user wants to do, and connecting it to APIs that can do those tasks.

I almost completely agreed with you, but this is not true! Apple is trying to solve the task & API problem with “task intents”, on which they go into more detail outside of the keynote: https://youtu.be/Lb89T7ybCBE

The new Siri models are trained on a large number of schemas. Apps can implement those schemas to say “I provide this action” (aka, the user intends to do this action). Siri can use the more advanced NLP that comes with GenAI to match what you say to a schema, and send that to an app.

These app intents are also available to spotlight and shortcuts, making them more powerful than just being Siri actions

apeace1y ago

Wow, that's great to hear! Excited to see what comes of it.

Larrikin1y ago

Non LLM conversational agents are a dead end. It's a waste of that one programmer's time now that we have an imperfect but pretty good solution. There is zero discoverability in voice commands and the best you'll do is remember 3 to 10 commands if you can't actually ask the agent anything. Better to have that person work on the team to improve the LLM.

sokoloff1y ago

Come home to find a yellow ring on the Echo. "Alexa, tell me my messages." "You don't have any messages. You have one notification; would you like me to read it?" <silently> "Jesus H Christ, what do you think?!" <aloud> "Yes"

throwaway_j121y ago

I have some personal insight on this one.

My opinion is that data access restrictions did not cause Alexa to fail. If you think about it, it wasn't lack of machine learning that contributed to its issues. Alexa attempted to solve the long tail of customer requests with the equivalent of spaghetti "if statements" - rule engines. This was never going to scale. Alexa did not have a generic enough approach to cover the long tail of customer requests (e.g. AGI). With rule engines, there was always a tension between latency and functionality. Alexa solved this with bureaucracy - monitor latency, monitor customer request types, and make business decisions about how to evolve the rule engines. But it was never fundamentally able to scale out of the most basic requests or solve chicken-egg problems (customers don't ask complicated requests because Alexa isn't capable, so they don't show up as large enough use cases to optimize for). Top use cases remained playing music and setting timers.

A more fundamental issue was monetizing. Early on Bezos liked the idea of having a small, essentially free, device that would reduce the friction to buying things. If you remember the "easy buttons" Amazon floated there were many ideas like this. In practice, building a robust voice assistant that could purchase items proved challenging for a myriad of reasons. So the business looked for other ways to monetize. Advertising kept coming up but there was rank and file pushback to this because it could break customer expectations and/or privacy concerns. Alexa considered pivoting into various B2B ventures (hospitality, healthcare, business) and other customer scenarios (smarthome, automotive) but took half-measures into each of them rather than committing to an opportunity. It felt like a solution looking for a problem.

Alexa would have (could still?) benefit from modern LLM technology. However to be truly useful it would need to do more than chat. It would need some layer to take actions. This would all have to be carefully considered and designed so that it scales - so that it isn't a bureaucracy trying to measure what people are wanting to do and "if statement"ing a rules engine to enable it. OpenAI and others appear to be poised with the machine learning expertise to do this.

Finally, it's my opinion that Alexa's machine learning scientists were very good, however as a population they did not appear to me to really care about the business/product use case. Many of them worked on research for publication on problems like distance estimation, etc. The expertise was very heavy on voice transcription and audio processing. However there was less expertise in "reasoning". This I hypothesize contributed to the approach of iterated rules engines, with the science community focused primarily with improving transcription accuracy by small numbers of basis points.

j / k navigate · click thread line to collapse

213 comments

karaterobot1y ago

> Amid this news, a former Alexa colleague messaged me: “You’d think voice assistants would have been our forte at Alexa.”

robbiemitchell1y ago

It wasn't even set up for success at selling.

There was no serious attempt to drive real commerce. Instead, Alexa became full of recommendation slots that PMs would battle over. "I set that timer for you. Do you want to try the Yoga skill?"

ClassyJacket1y ago

I'm actually working on an app that solves this for a specific use case, tho it isn't in the retail space.

5 more replies

shortrounddev21y ago

If they rebuy your most recent purchase instead of the promoted brand, they don't get advertising revenue

1 more reply

ghostly_s1y ago

SavageBeast1y ago

My personal favorite is "Alexa! All Hue Lights ON" here - I only trust Siri with important cooking timers.

2 more replies

dhosek1y ago

burningChrome1y ago

> truly a treasure trove.

You forgot the part about it solving crimes.

https://broward.us/2023/07/18/amazons-alexa-is-surprise-witn...

montag1y ago

Imagine how many millions of “shut up!” “F—- you alexa!” Etc.

1 more reply

potatolicious1y ago

Every other decision is downstream from that unresolved tension.

ansible1y ago

Huh, interesting.

While it seems to work fine on the speech recognition part, in that Alexa understands the words I say, it never seemed good enough to actually navigate a task like ordering the right kind of filter.

I knew there was some behind-the-scenes scripting going on, but I didn't realize just how much...

We mostly use our Alexa for kitchen timers, reminders, and video calls with family. Occasionally for playing music too. No, I don't want to subscribe to Amazon Music Unlimited.

ryandrake1y ago

> "Shopping with your voice" never took off despite many attempts.

3 more replies

malfist1y ago

Guess which I picked?

ssl-31y ago

Even when sitting in front of a real computer, it often takes fair amount of effort to find a product that represents the kind of value at the moment that I'm interested in.

It's not functionally better than Amazon's abortive Dash Buttons[0] from 8-ish years ago, which were also untrustworthy for many of the same (or related) reasons.

---

This is important to mention because if I weren't already busy with my hands, I wouldn't bother with using Alexa at all for this task.

And it can't ever get better until they fix their data.

[0]: https://en.wikipedia.org/wiki/Amazon_Dash

snarf211y ago

PaulHoule1y ago

Amazon could be at the late Sears and Robucks phase already where it cares more about people subscribing to Prime than it does selling things.

It’s painful to see them give up a good brand just as the moment when a change in technology could have given them wheels…

pbhjpbhj1y ago

They cornered so many markets and, surprise, used that position to let every go to shit for a profit. Still at least Bezos got to wave his wang at the World by going to space.

SavageBeast1y ago

AMZN execs are watching AAPL stock go bonkers today on yesterdays AI announcement. APPLE IS DOING AI AND WE ALL NEED NEW PHONES TO USE IT!

I expect a similar thing to happen when AMZN announces some AI consumer product. Never mind they were in a Prime (ahahah - get it - "PRIME") position to be the first mover here.

An opportunity good and truly squandered.

1 more reply

HaZeust1y ago

Speculating that the brand behind AWS might be in the late Sears phase is hilarious to me.

1 more reply

vishnugupta1y ago

Absolutely! I’ve commented about it a few times on HN[1] as I was at Amazon at the beginning of Alexa of investment.

[1] https://news.ycombinator.com/item?id=40009295

planetafro1y ago

blackeyeblitzar1y ago

> it was to sell more stuff on Amazon

Is that actually true? I cannot imagine that they are even marginally successful at that. In fact, I can’t identify what exactly Alexa succeeded at, beyond being a voice activated kitchen timer.

> that cannot be the metric that leadership actually cared about

I think the metric was promotions for Alexa employees, sort of like a lot of projects at Google.

ssl-31y ago

It's a networked, voice-activated kitchen kitchen timer, but it's a shitty one.

Suppose I put a roast in the oven and retire to my office to do something completely unrelated to cooking, where I cannot hear what happens in the kitchen.

"Alexa, set a timer for two hours and notify me everywhere" seems like a perfectly cromulent thing to do.

But it isn't that way. Timers follow Vegas rules: Timers that that start in the kitchen stay in the kitchen -- they cannot be heard anywhere else.

It's not superior in any functional way to the old dumb digital timer on my oven, which has a VFD and a rotary encoder to set a timer.

Except, fucking perplexingly: Alexa can notify me in the office when my oven timer beeps in the kitchen. This works fine.

All that is clear is that there is nobody steering this fucking ship.)

1 more reply

zitterbewegung1y ago

brevitea1y ago

I am literally laughing out loud, because Amazon had decided to fire majority of the Amazon Pay team, which is why there was no longer a clear way to handle payments.

leros1y ago

I got frustrated with that and tossed all my Alexas.

potatolicious1y ago

One thing really worth addressing from the post that I don't think author accepted, and I see this a lot with engineers:

If I can pump one learning into engineers' and PMs' heads it's this: intermediate deliverables are not optional no matter how cutting-edge your team is.

Again, once again for the audience in the back: there is no such thing as a multi-year project without convincing, meaningful intermediate deliverables.

To clarify, that doesn't mean "don't have multi-year roadmaps", it means "your multi-year roadmaps must deliver wins at a consistent cadence".

Understanding this will carry you a lot further in the industry.

As a fairly cutting-edge R&D team part of your job is to figure out what slice of this is shippable (and worth shipping). If you're coming up empty you are not ready to pitch this to execs.

mlinhares1y ago

golergka1y ago

> If you push in any way they start to scream "tech debt" and everyone just accepts it.

Just as real-life debt, building a company without it is unrealistic and unwise. You just have to manage it.

1 more reply

epolanski1y ago

> To clarify, that doesn't mean "don't have multi-year roadmaps", it means "your multi-year roadmaps must deliver wins at a consistent cadence".

What you describe is exactly the opposite of research: which is collecting neverendin failures.

An environment that lives by such logic cannot really lead to major technological breakthroughts. And in fact, Amazon has very little of those to show compared to the rest of the SV.

bigstrat20031y ago

1 more reply

snom3801y ago

openmajestic1y ago

But it's not obvious to me that approach was even a net win for Google as a business. Did Google Brain invent the technology that killed Google? TBD I think.

1 more reply

malfist1y ago

This _highly_ depends on your field and what you're working on.

Working on the latest and greatest social media website? Sure, ship early, ship often.

Working on medical devices? You better not ship a prototype.

Working on hardware? Too expensive to pivot from learnings, better get it right the first time.

Working for NASA? You better get it right the first time and predict all future issues that might be possible, and you better document it 9 ways to sunday.

buildbot1y ago

iknownthing1y ago

potatolicious1y ago

> "It's a catch 22 with ML though. What you wrote is completely true however with ML you cannot say "We will get to 98% precision and 92% recall by Q4"."

2 more replies

buildbot1y ago

1000 times this - you need to deliver something. Even very long projects such as medical research have intermediate stages and goals to meet.

Very experienced people tend to forget this from time to time too and get excited or convince themselves "big risk big reward"... I've never seen that work out.

jll291y ago

Executive patience, focus and planning horizon are the immediately next 1-4 quarters, years 2 and 3 and perhaps 5 if you are lucky and that's it, and they might not even be around in five years.

j7ake1y ago

Even in academia, short consistent wins are needed to ensure your lab stays open.

Heck even Andrew Wiles, someone who only needed pen and paper for research, had to publish papers during his 7 years working on Fermats last theorem.

mips_avatar1y ago

1 more reply

notyourwork1y ago

ryandrake1y ago

kalleboo1y ago

pomian1y ago

ryandrake1y ago

It's truly gone nowhere. Still, the most advanced thing you can reliably get it to do is "Set a timer for 10 minutes."

Until they can do this, tech companies might as well stop bothering releasing incremental crap products that can barely understand a task I'd expect a 4 year-old to be able to do.

1 more reply

brookst1y ago

Modern ML and embeddings models are the discontinuity that was needed to get from "massively complex hack that can't scale" to "even more complex but principled approach that scales pretty well".

johnyzee1y ago

If only there was a technology that understood unstructured natural language really well.

nextworddev1y ago

Hmm, I'm an early Alexa PM (2016) that left Alexa before the OP joined it (2019).

Alexa's main failure was mainly that the tech wasn't ready - it was basically a ASR + NLU + rule engine. If we had 2023 LLM tech, then we may have "won" the Assistants market.

Yes, organizational bloat and politics was a problem but OP was hired as a result of the mass hiring spree, so he was a beneficiary of that.

dbish1y ago

Though I also very much agree with the other point of OP that privacy paranoia also blocked development. The privacy team seemed like they would have been most happy if we couldn't ship.

nextworddev1y ago

the privacy thing became a real roadblock due to GDPR emerging after the whole Facebook scandal.

can confirm that several of my launches got delayed to bolt on GDPR

mips_avatar1y ago

I wouldn't really call OP a beneficiary of it. As much as I like drawing a paycheck I don't want to work for an organization that's set up to keep me from succeeding. I imagine OP feels that way too.

krisroadruck1y ago

nextworddev1y ago

Yep, can confirm that it was better in 2017 than 2021.

ratg131y ago

I have a first gen echo and generally it has gotten better over time.

In the early years I couldn’t control the Phillips hue lights in my home, and then one year suddenly I could thanks to updates.

Most companies would have abandoned hardware this old.

2 more replies

pbhjpbhj1y ago

They don't seem to have modded Alexa to keep pace with accessibility of the pubic to LLMs though? Presumably they'd have just let that market lead slide in the same way.

nextworddev1y ago

They are actually planning to do that - it's just a software update - though they are doing that with Amazon Titan model, which might be kind of behind..

Also they are trying to sell as a subscription which is interesting since Siri is free.

ldjkfkdsjnv1y ago

Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

criddell1y ago

By the way, oxygen is a colorless, odorless gas with an atomic number of eight.

If you would like to know more about elements just say "Alexa, tell me about the periodic table of elements".

djeastm1y ago

Alexa's "by the ways" will be its downfall.

What kind of assistant says "by the way" apropos of nothing? An annoying one. It's just a thinly disguised ad that was never asked for.

chrisjj1y ago

Oxygen gas is molecular not atomic - at least on this planet.

1 more reply

esafak1y ago

You sound like Clippy. "It looks like you're writing about oxygen! Would you like to know its chemical properties?"

rubyfan1y ago

elbasti1y ago

> Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

That would be a data point in favor of the amazon strategy, no? Prevented millions upon millions of being invested in developing losing technology.

The window to integrate LLMs (especially from Antrhopic, in which Amazon is an investor) is closing but not shut.

If they can do so well, they have massive distribution power to catch up and drive rapid adoption of alexa 2.0.

ldjkfkdsjnv1y ago

Honestly, yes it is in favor.

1 more reply

iknownthing1y ago

Interesting given how many products out there now are negatively described as ChatGPT wrappers. Sounds like Alexa was just an open source model wrapper.

jjtheblunt1y ago

> Regardless of the above, ChatGPT made almost all NLP technologies across all companies obsolete.

that's overstated, because you accidentally lumped speech recognition in, and i imagine Nuance (https://nuance.com) and others are like "hold my beer".

realfeel781y ago

Nope. OpenAI's actually-open-source Whisper blows Nuance and all other legacy speech recognition tech away.

1 more reply

neilv1y ago

The first paragraph of the first section of reasons given is:

I really don't want this to be a message companies are hearing right now -- that being conscientious about customer data is a lethal barrier to progress, in the "AI" gold rush.

wepple1y ago

Whilst there may be some inefficient bureaucracy in large orgs, I’m tempted to offer the following proposition:

Companies that are absolutely at the forefront of AI, must be by definition, doing terrible things wrt privacy & security.

pessimizer1y ago

> I really don't want this to be a message companies are hearing right now

They won't be hearing it, they'll be (and are) sending it. "AI would be better for you if it had fewer guardrails around your privacy. Trust us."

johndhi1y ago

You may not want privacy guard rails to be contrary to progress in AI but I'm afraid they may be.

keeptrying1y ago

At Amazon you get paid when you create whole new businesses which you used to pitch to Bezos.

Incremental improvement was rewarded through the regular stock and pay process.

Thus no one cared enough to quickly switch to LLMs.

Suoer interesting how org design - even when brillaint can be severely lacking.

orwin1y ago

And I think they're better for it.

We use both azure ans AWS at my current org. We recently had an internal 'hackathon' to try Llms in both orgs (Claude for AWS, ChatGPT4 for Azure) on our knowledge bases.

Clearly, we couldn't differentiate them on response quality, not in 3 days, but on how easy and integrable the LLM was, AWS was superior, even for our mostly Azure teams, weirdly.

laidoffamazon1y ago

malfist1y ago

We're one of the few companies that count stock growth against you during comp reviews.

1 more reply

markx21y ago

I bought into Alexa/Echo because at the time my late wife could not work remote controls (Multiple Sclerosis). She only had her voice.

It was great for her to play different radio stations, playlists, news. It did the job.

I did try linking it to a TV but that was terrible. Slow, janky, unreliable.

Since she died - Dec 2018 - "Alexa play LBC" "Alexa stop"

Oh, if you do have an Alexa device "Alexa, what noise does a hamster make?"

djeastm1y ago

> Oh, if you do have an Alexa device "Alexa, what noise does a hamster make?"

I just did this and it said "Here's a hippopotamus' grunt"

So that's how it's going for Alexa in my house

brevitea1y ago

AGI Alexa is truly a work of innovation. It gaslights you into thinking you wanted hippopotamus.

laidoffamazon1y ago

lcnPylGDnU4H9OF1y ago

I get the feeling that you weren't intending to say so many kind things about the access control practices for Alexa data. Sounds like good work.

laidoffamazon1y ago

1 more reply

GJim1y ago

You're complaining that personal data was well secured?

This should be a given. The fact you think otherwise is worrying.

(Yes, data security legislation does introduce barriers. Tough. Get used to it.)

labrador1y ago

[1] https://www.aboutamazon.com/news/company-news/amazon-anthrop...

[2] https://www.anthropic.com/research/claude-character

AnotherGoodName1y ago

Obscurity43401y ago

AIPIS

hawski1y ago

After Apple announcement they should name it Amazon Intelligence.

brookst1y ago

Pity Google, Microsoft, Meta didn't have the foresight to name their companies in preparation for the AI boom. Adobe, Autodesk, Alibaba, and AirBnB are looking pretty smart.

I suppose this might explain the Google -> Alphabet thing but they haven't really embraced the new corporate name enough for "Alphabet Intelligence" to make any sense.

throwinga101011y ago

Part of me think this is the reason why you want bloat in orgs, so that motivated people with enough redundancy will actually feel comfortable chasing longer term incentives.

zhyder1y ago

hn_throwaway_991y ago

1. Set my alarm/timer.

2. "What's the weather?"

3. "Turn on/off my lights" for those with connected lights.

etc.

We've had voice calling for over a century, yet it feels like the majority of us prefer to text most of the time these days.

wenc1y ago

> We've had voice calling for over a century, yet it feels like the majority of us prefer to text most of the time these days.

1 more reply

empath751y ago

ultrarunner1y ago

ghaff1y ago

Right. You develop a small number of simple, useful, common tasks and you pretty much just stick to those.

tssva1y ago

My guess is that there is going to be all this investment and research poured into AI voice assistants and in the end the result is going to be the same.

danielbln1y ago

What makes you guess that?

1 more reply

TheGRS1y ago

suyash1y ago

Siri dropped the ball even before Alexa, it had the golden opportunity to become ChatGPT about 10 years ago.

darby_nine1y ago

I will invest a lot of money in the first voice system that offers serious text editing so i can correct mistakes without repeating everything.

sambull1y ago

I can't even do that with other humans - or they can't do that with me, or some combination of that.

evanmacmillan1y ago

Gridspace —> http://www.gridspace.com

MH151y ago

There was something on Show HN that did this with some promise recently, can't find it now

pneumic1y ago

Aqua Voice: https://withaqua.com/

add-sub-mul-div1y ago

webappguy1y ago

RheingoldRiver1y ago

i want talon voice for android

rco87861y ago

3 nights ago at about 1am my Alexa started blasting heavy metal music at 1am.

I unplugged it and am not too sure about plugging it back in.

thiagoeh1y ago

Someone, maybe a family member, has an Spotify account connected to Alexa, and chose your Echo device as output

mh-1y ago

dpkirchner1y ago

About a month or two ago Alexa started telling us it didn't understand "stop" while it played the audio we didn't ask for (Alexa's voice recognition is really poor). I had to unplug it.

I probably could have used the app to stop it but I didn't think about that at the time.

stderrout1y ago

oidar1y ago

sporedro1y ago

Does siri/homepod not just directly connect to your iPhones calander/notes/reminders?

oidar1y ago

datpiff1y ago

> A smart speaker is almost an essential device among my group of friends.

You live in a bubble

GJim1y ago

> A smart speaker

Can we give these things their real name: Smart Microphones

Alexa = Amazons microphone.

creeble1y ago

I'm not sure I would have a lot of use for voice-controlled conversational AI, as odd as that may sound.

I use generative AI for other things, mostly writing code for me, or telling me about code problems in general. It's rare that I want output that I'm _not_ going to copy/paste somewhere.

Alexa isn't a failure, it just didn't sell more stuff for Amazon. And, well, it costs an awful lot for them to keep running. So maybe it is.

_yb2s1y ago

brookst1y ago

jedberg1y ago

Voice assistants are much better with lower voices than higher ones (especially Alexa). One tip is to deepen your voice, that might help.

_yb2s1y ago

My voice is extremely, unusually deep. I actually suspect that is the problem itself, it is deep enough that it doesn’t even register as speech to these systems.

jarjoura1y ago

It definitely captured the market, but without a top down vision, the whole thing was just a huge letdown.

blackeyeblitzar1y ago

esafak1y ago

This is an industry-wide problem. The solution is not to overhire and crack down on these empire bulders, but this means fewer FAANG jobs. Engineers have to pick their poison.

tlogan1y ago

I set up our home with smart shades, lights, iAqua, Somfy, etc., but my family and me now just use the remotes due to frustration with Amazon Alexa.

I was always under impression that Amazon uploads all our data because I notice data transfers whenever I use voice commands, which makes me doubt their privacy claims.

It seems like Alexa was designed more to learn from us rather than to genuinely assist us. Its primary goal appears to be gathering data rather than helping users.

bigstrat20031y ago

> I was always under impression that Amazon uploads all our data because I notice data transfers whenever I use voice commands...

As I recall, they said as much. The device uploads a clip of the audio to get processed by the back end, does it not?

drcongo1y ago

We've had weeks and weeks of "Apple have dropped the ball over AI" so now it's Alexa's turn I guess.

jackconsidine1y ago

> Alexa put a huge emphasis on protecting customer data with guardrails in place to prevent leakage and access

pciexpgpu1y ago

You don’t need to bring a rocket launcher to a banana fight.

Most of the queries are gonna involve setting an alarm or turn on/off a thing.

They didn’t drop the ball- they were very customer savvy and really knew what they were getting into.

pjs_1y ago

I remember talking to Alexa about eight years ago. I was a bit skeptical and asked, "what color is a red car?". I was seriously impressed that it knew that a red car is red.

booleandilemma1y ago

I threw my Alexa in the garbage after I asked it to turn off my house lights and it tried to recommend something (a schedule or something) for what must have been the third time.

mrkramer1y ago

I never used smart speakers but do they allow third party apps and if they do how developed is the ecosystem? I see so many use cases as of voice commanded apps, voice games etc.

barumrho1y ago

Alexa is not on a phone or PC. It could only be so useful being available only at home.

jedberg1y ago

Sure it is. There are Alexa phone apps that do full Alexa.

Up until about a year ago you could also do it on the computer but they took it down. https://alexa.amazon.com/

brevitea1y ago

... Because it was a huge privacy risk.

apeace1y ago

I've always thought it was Apple who dropped the ball with Siri.

The voice recognition has improved since then. But the functionality still sucks.

When I'm in private, there are a couple commands I'll use.

- "Hey Siri, call xyz" where xyz is someone in my contact list I have tested with Siri and is known to work. Not recommended to try without testing first.

- While cooking, "Hey Siri, set a timer for 10 minutes." Works great.

I never understood why they couldn't make this tool better, even before LLMs and without any AI at all. Just hard-code a bunch of phrases, and ways to translate those phrases into some action.

"Hey Siri, how close is my UPS delivery?"

"Hey Siri, where can I get the best price on xyz cat food?"

"Hey Siri, what's my bank balance?"

"Hey Siri, how much is a Lyft to xyz?"

I bet if they had a single developer working on adding Siri commands full-time, they could announce something like 20-50 new Siri functions at every WWDC.

But it seems the goal now is just "Make it an LLM," instead of focusing on recognizing the task that the user wants to do, and connecting it to APIs that can do those tasks.

They could've dominated the "conversational system" market 13 years ago.

noahtallen1y ago

> But it seems the goal now is just "Make it an LLM," instead of focusing on recognizing the task that the user wants to do, and connecting it to APIs that can do those tasks.

These app intents are also available to spotlight and shortcuts, making them more powerful than just being Siri actions

apeace1y ago

Wow, that's great to hear! Excited to see what comes of it.

Larrikin1y ago

sokoloff1y ago

throwaway_j121y ago

I have some personal insight on this one.

j / k navigate · click thread line to collapse