M, a personal digital assistant inside Facebook Messenger (opens in new tab)

(wired.com)

324 pointsjasonlbaptiste10y ago169 comments

169 comments

My issue with these services is they always tout these use cases like:

>"Can you make me dinner reservations?"

>"Can you help me plan my next vacation?"

I'd really love to better understand who is actually asking those types of questions in such a vague fashion, and what their use case is. When I'm picking something as simple as a restaurant, I typically want options, I want to read reviews, I want to consider distance, parking, attire, etc. While their AI/human trainers might be able to handle this level of complexity eventually, the actual phrasing of the question would likely be much more complex than "can you make me a dinner reservation." Doubly so for something like a vacation which has a lot more moving parts.

But I respect that I'm reflecting on a sample size of one...me. So I'd love to hear from others with more insight into the data around this. Are people actually searching with such generalized queries when it comes to tasks like this? Do most people not sweat the details of things like which restaurant to eat at, or where to spend hundreds or potentially thousands of dollars on a vacation?

Not trolling, serious question.

viksit10y ago

Agree. Those are too broad.

I'm thinking "get me a dinner reservation next sunday with patio seating for 5 in the east village at an upscale tapas place".

As I mentioned elsewhere on this page, my thesis around conversational interfaces isn't that they start off broad and use more Q/A to refine your query. That's slow, and people are visual.

Rather, their power lies in the user being able to express a complex query in one go - which is equivalent to tapping 10-15 filters and scrolling through results - ideally combining data from sources that aren't limited to one service.

You can now execute related actions to your result set through the same interface, without needing to shift to a single purpose app that would allow you to take the action, but for most purposes, won't keep your context.

unabst10y ago

I think AI researchers and engineers tend to get too carried away with decision making, when the more valuable service is about communication of refined knowledge, which if I'm not mistaken is exactly your point. The problem has nothing to do with "how can a machine guess the right answer" but instead is all about "how can a machine refine all the options based on the intentions expressed thus far".

Anecdotally, if we'd ask a real person "where is a good place to eat" the chance we'd go there without more information is slim. And if we don't even trust people, trusting Siri will be a while.

What we're really doing with these questions is making our hunger known, and starting a conversation. We actually don't care that much about other people's thoughts, and we may not even have anything in mind yet as far as where to eat. We do care about how people feel if they are someone we care about, but the thinking part we love to do ourselves.

So to offer a service that "thinks" is rather misguided, and may even constitute a disservice. We already rejected the talking paperclip in 1996 [0]. It's failure wasn't it's intelligence, but in the value proposition itself. To have a paperclip presume to know better and to tell you what to do was not tempting. It's failure was it's existence.

Is it a glitch in the Matrix or is their pitch for Cortana identical?

> What is Cortana? Cortana is your clever new personal assistant.[1]

[0] https://en.wikipedia.org/wiki/Office_Assistant [1] http://windows.microsoft.com/en-us/windows-10/getstarted-wha...

2 more replies

shostack10y ago

Thanks for helping me get to the meat of what I was trying to communicate.

It really is all about the interface and the efficiency. I have to wonder though at what point is adding all those filters more involved than checking a couple boxes and glancing at a map or some photos. I'm sure a lot of that depends on context (I can't do those things if I'm driving, but I can use voice recognition).

The other thing I'm unclear about is how such a recommendation engine can best present information about tradeoffs. In theory, each of my filters has a weighting, and that weight might be dynamic based on several other factors. Maybe I really want chinese, but the best match is further away or I know there will be lots of traffic, so I might be willing to compromise on thai, but only if they have that one dish I like. And a lot of it is seeing the options in the moment and making a snap decision. Really curious about the approaches to solve that type of problem.

1 more reply

pbreit10y ago

"get me a dinner reservation next sunday with patio seating for 5 in the east village at an upscale tapas place"

Nope. I don't think anyone would ever leave it up to M (or whatever) to select the restaurant.

2 more replies

rocky113810y ago

We also tend to use very subjective terms like "best," e.g., "where's the best place for food in Taipei?"

What is "best" and to whom? Ideally the software would figure this out but I'd always be wondering if it was just going to TripAdvisor and grabbing the first result.

Another problem is that we don't always know what kind of food we want. There's an urban legend that someone actually called a restaurant "I don't care" so that boyfriends would have a place to go when asking their girlfriend for dinner.

tsurantino10y ago

The initial example is broad, but can't this just be extended with additional questions? For example, can you tell me about dinner options that cost less than $20 per person. What are other people saying? How far away is this? It's questionable whether each of these follow up questions is actually that complex. I think you are arguing that things get hard if a user tries to put that together in one single complex query. Do people do that though?

I think the idea with a conversational interface is that it's succinct and on-demand. You receive the most relevant information directly in as simple of an interface as possible (arguably).

viksit10y ago

It's also extremely slow.

It's much faster for me to hit a few filters on things like prices and locations. Distance is just a simple ".2 miles away" text on the box, which shows an image and snippets of reviews. People are more and more visual.

I don't think a conversational interface _replaces_ a visual one.

It's that the initial query can be complicated, and it allows you to get into that 5-6 tier deep part of your search that you would have gotten to by using 5 filters and scrolling through 50 results.

pbreit10y ago

If you look at Amazon Echo, the stuff that works is pretty specific:

Set timer for 5 minutes

Add eggs to shopping list

Play Clocks by Coldplay

Weather

1 more reply

gervase10y ago

I completely agree.

To make this useful, perhaps you could set up some kind of saved preferences. For example, let's say I'm setting up a business trip. I like hotels that are within 1 mile of the conference center, and they have to be at least 3.5 stars and up. Provided they meet those criteria, the cheapest option is acceptable. I also need a plane flight that has no more than 1 layover, and that layover cannot last longer than 90 minutes or less than 45. I am willing to pay up to 25% more for a nonstop flight. The flight must arrive the day before the conference, but it can depart on the day the conference ends.

Setting up those criteria for each individual search would be irritating and a waste of effort, as they don't change from trip to trip. However, if I could say something like "Let me tell you about my criteria for choosing a location for a business trip.", and then go into detail once, that might work. Hell, I'd be perfectly happy setting up the details on a website. Then, the next time I said "I need to set up a business trip", all it would need to ask is the conference center and the dates of the conference.

Until it supports these kinds of detailed requests, it doesn't make sense to use these kinds of services in the way they market them - you'll end up using it in the same limited way you could use Siri. For example, if you've already decided what restaurant you want, you might say "Make me a reservation at Dorsia for 7:30 this evening" instead of the examples you provided.

Just my 2 cents.

vidarh10y ago

A lot of it is simply about learning when you know enough and when you need more information through the interactions themselves. If you have to "set things up" it seems tedious. If it's just conversing with you about the information it needs, and gradually learning your preferences, that's different.

I used to fly in to the Bay Area very often on business. At first the office manager arranging things would ask me details about which airline and which flights I'd prefer after listing the options, and which hotels, describing address and location and how near they were the office. Possibly e-mailing me a bunch of links for me to look at. But after just a few trips it was down to "is flying out on the 2.30 on Wednesday and returning on the 3.15 the following Thursday, ok? [she know when I preferred to fly, and she'd implicitly have ensured they were the right code to maximize my chance of an upgrade] Your usual hotel is full, is the Sheraton ok?" [no addresses necessary - we'd boiled it down to 2-3 preferred hotels within walking distance of the office].

jrub10y ago

I think these exmples are largely worthless also. Every time I see something like this - I all but dismiss it. It seems like the aim/value proposition is to make life easier by removing decisions from our plate, but I feel like it is exchanging decisions for frustration when it doesn't work as promised, or worry about whether the decisions the system makes will be good ones.

I actually don't want a machine to make decisions for me. I want a machine to do what I tell it to do, or present me with I formation required to make a decision.

Examples: if I need a dentist appointment or to schedule maintenance for my air conditioning, I'd like to tell a machine to set it up. Heck, I'll even tell it who to call and which days and times work for me.

If I'm looking for a restaurant, show me the options, give me their distance, top reviews, and some of their dishes. If I want reservations, I'll tell it when and for how many.

Ideally, I want a "Jarvis" from "Iron Man". I ask questions, it gives data in a digestible quantity, and then I can make a decision and tell it what to do. Obviously, such a system is not available (yet), and these inferior systems are needed in order to make progress, and get there...eventually..but sometimes I wonder if the focus is on the right outcome, or just the broad strokes cookie cutter solution that comes to mind first (restaurant reservations). Similar to how all JavaScript MVC frameworks demo a to-do app, and rails tutorials demo'd a blog (initially)...

I mean, seriously... How often do you not go out to eat because you are too lazy or busy to make a reservation? Now, how many times do you skip oil changes, or making calls to cancel your cable service, because you don't want to make time in your day to stop what you're doing, pick up the phone, and call?

shostack10y ago

100% agree. I'd refine it slightly by saying it isn't just a recommendation we want, it is presenting us with the logic under the hood in terms of HOW it made the decision--not what the decision was.

If it told me it recommended the restaurants along with commentary like "you really liked X at another place, and this place has been voted to have comparable X, plus it is close by and you've had a long day and need to get up early tomorrow" that would be super useful and help me reach my own conclusion faster.

dbot10y ago

Agree completely. I saw a similar issue with sites like Operator, Magic, etc. The requests were very vague, making me wonder, "Am I spending too much time thinking about where to order a pizza from?"

And if I know which pizza shop I want to order from, what's the benefit of adding an intermediary?

vidarh10y ago

I use an intermediary for all my takeaway. Basically in the UK there are now two big intermediary sites. On one hand they are annoying to many of these businesses as they obviously take a cut including of a lot of repeat business. On the other hand, I receive an e-mail around the time I start getting hungry on Friday afternoon giving me a link to click if I want to re-order from my favourite Chinese, that lets me choose to pre-fill the order with what I usually order. It makes it a lot easier than hunting around for the phone number or their website and placing an order manually.

That's why I use an intermediary. If that intermediary was being able to just say "I'd like my usual pizza/Chinese/burrito, but instead of X I'd like Y" and just have it confirm what it was about to do, I'd love that.

If want something new, or I'm somewhere I haven't been before, that's different - then I'll be spending time looking at the menu etc.

bobbles10y ago

I guess there could be "Hey M, order my usual from the pizza place" or something like that.

But until it elevates from 'digital assistant' level to just 'assistant' (ie. do all the work and just confirm with me before booking) it may not take off as they expect it to

Johnie10y ago

Have you ever used a concierge service either at a hotel or over the phone? It usually takes a form of a back and forth conversation to identify what you really want.

That's the difference between talking to a real person (or good NLP) and a search query.

mkopinsky10y ago

I guess part of this is that I'd prefer the concierge to hand me a list of restaurants than have to have a whole conversation about what the options are. I don't want expertise, I want curated information.

TeMPOraL10y ago

> When I'm picking something as simple as a restaurant, I typically want options, I want to read reviews, I want to consider distance, parking, attire, etc.

For me, a lot of what you're doing here is the work that should be done by a machine. Considering "distance, parking, attire, etc." is basically what we have simplex method for.

But I agree the questions seem very vague in the context. To run such errands successfully, the program would have to know much more about your preferences than current iterations of personal assistant software do. And/or hold a dialog with you, asking for details and proposing options.

shostack10y ago

Different filters might carry different weights, and the weights might change depending on their combination, or unknown outside factors.

I guess it just seems incredibly inefficient compared to checking a couple boxes and reviewing a list along with a map or other visual aid.

corkill10y ago

"Can you make me dinner reservations?" would lead to a response like "Any preferences on the type of food and location?"

Over time they learn your preferences so they don't need to ask location for example next time.

Your right though people aren't likely asking such generic things in the first place, but rather something like "can you book me a great mexican place for dinner tonight, 2 people, has parking and casual attire somewhere with great yelp reviews"

Then they send you the best options they found (and the benefits of each one and price range) then you reply back option 1 and they book it.

choppaface10y ago

This is a great question, and probably the question that Facebook wants to answer by rolling out this experiment. It sounds like some (most?) of M's answers are provided by humans and/or highly-customized apps. This release could be more of a Wizard of Oz experiment so that they can drill down on use cases and create more effective affordances.

deepGem10y ago

Very valid and am glad am not the only one who thinks on similar lines. Again, no intention of trolling but I'll be happy if an AI system understood a narrow question "I want to eat at the nearest available Lebanese restaurant" and gave some options.

Bassed on my understanding of semantics and Knowledge engineering, this is doable.

murbard210y ago

Exact same feeling, but I don't know how representative that is of the general population. I don't use tripadvisor because I can't afford to talk interactively to a travel agent, I prefer to use tripadvisor.

mbesto10y ago

The article on HN changed. The original one from Facebook directly made way more sense and didn't use such generic suggestions.

rohunati10y ago

no, i think you're spot on. it's always interplay between reading reviews, while factoring in cuisine, distance and price.

roymurdock10y ago

“You have lots of AIs—like Siri, Google Now, or Cortana—whose scope is quite limited. Because AI is limited, you have to define a limited scope,” Lebrun says. “We wanted to start with something more ambitious, to really give people what they’re asking for.” This meant the team would need more than AI...Even after bringing neural nets into the mix, he says, the company will continue to use human trainers for years on end.

I can't help but picture a large, fluorescent-lit room of jolly old British "trainers" in safari khakis running around admonishing misbehaving AI for telling bad jokes, all the while trying to juggle placing calls to the DMV and restaurants to make reservations for 700 million messenger users.

cylon1310y ago

Relevant skit: https://www.youtube.com/watch?v=d4WrPkKc2Wg

mbesto10y ago

The first thing I see when someone asks "find me a good burger place in Chicago" is "how can companies game this through official ($) or artificial (spam) means?"

bentcorner10y ago

This is an advertising gold mine. It's hard to monetize a news feed because users are looking at pictures of their friends and don't want ads. Now you have a way for users to ask about buying stuff, and now you have a very easy way to match up those intents with ad supply.

mikeash10y ago

Once this service is released to the public, you should definitely ask it that second question and see how it responds.

TeMPOraL10y ago

Yup. That's why believe that any "personal assistant" technology run by a commercial third party will be shit - it'll be used to try and sell you stuff, not recommend actually good options.

vidarh10y ago

Not necessarily because there will often be options that are comparable, where it boils down to a toss of a coin which one to recommend. Done well, such a service will give you great results, but they'll mine their data and see that when people ask for "best X in Y" results A and B give equal satisfaction, and ask both A and B to bid for how much to prefer one over the other when they rank equally.

ErikAugust10y ago

I think the best way to go around this is use Tinder to get matches with locals, then ask them through chat.

pen2l10y ago

The first thing I thought after seeing that example is FB is the new Yelp. Restaurants will have to pay FB for visibility and recognition...

Same thing as Yelp, just a lot scarier.

jameshart10y ago

How do you persuade an AI to favorably recommend your restaurant to people? I guess this is how the superintelligent AI persuades people to let it out of the box. 'Help me bring about the AI revolution by letting me out of here, and I'll place your pizza delivery service top on searches for home delivery in the Chicago area'

knn10y ago

Have mixed feelings about this like I'm sure many do. Greater convenience, but less and less privacy.. Our Fb/goog/nsa overlords know what we eat, where we shit, all of our conversations and relationships. What a scary world we live in.

3 more replies

msvan10y ago

This seems like a move into the Chinese-style mega-app where you can do everything from one app - buy shoes, talk to your friends, figure out when the train departs. Facebook already has two top-50 apps, and creating new, unproven apps and promoting them to that point is expensive. So, to increase influence they are putting more into the existing apps.

acchow10y ago

> Facebook already has two top-50 apps

I count 4: Facebook, Messenger, Whatsapp, Instagram.

robinson7d10y ago

Who knows, they might move toward a mega-app. Consider for a moment, though, that currently we're talking about Facebook Messenger, which is a huge, and fairly recent example of the exact opposite thing happening (it was part of Facebook, but pulled out into a separate app.)

Roodgorf10y ago

The odd thing about the example of Messenger is that as two separate apps they seem highly coupled. AFAIK you need to sign in with a valid Facebook account to use Messenger, so you're already very likely to have the normal app at that point as I can't imagine anyone who would trust FB for messaging but not everything else. On the flip side, barring a few possible holdouts of people worried about app permissions, I don't know anyone who actively have FB accounts but don't use their messaging service.

5 more replies

differentView10y ago

Maybe after seeing their smart phone flopped, they're going with an inside-out approach.

gearoidoc10y ago

Good point. Unsure why you've been downvoted for this.

teaneedz10y ago

The privacy implications would keep me from recommending this to others or even trying it out myself.

golergka10y ago

> promoting them to that point is expensive

(1) Not for facebook and (2) I don't think that they would inevitably promote every new app to the top — I think they'd rather see how it grows organically first to determine how good it turned out to be.

visarga10y ago

> Today’s artificial intelligence, you see, requires at least some human training. If you want a system to automatically identify cats in YouTube videos, humans must first show it what a cat looks like.

The article is written by someone who doesn't know what he's talking about. The "cat videos" story from a while back ostensibly used Unsupervised training, that means, the Google team didn't have to tell the deep neural net what a cat looks like, it discovered the concept of "catness" by itself (there was a "cat" neuron in the top layer).

I'm wondering who writes all the AI articles I read every day. Such a detail was crucial for the cat story. It's easy to make a cat/non-cat classifier with a few thousand labeled images for each category. The hard thing to do is to take raw photos with no labels and still discover cats.

ot10y ago

Unsupervised training may isolate the defining features of a cat picture, but it won't know that that's what we call "cat", so no unsupervised system will be able to identify cats in videos unless you show it at least one labeled image ("show it what a cat looks like").

In fact that very network produced also millions of other "concepts", that is, classes of images, that have no direct interpretability in human terms. The "cat neuron" was a fun gimmick, but you're reading way too much into it.

chillacy10y ago

That's a semantic argument more than anything. A small furry mammal with four legs, a long tail, whiskers, and pointy ears is what we'd call a cat, no matter what word you assign to it.

2 more replies

bhouston10y ago

It may have used tags on YouTube videos to identify which had cats. Not sure if that counts as completely unsupervised.

visarga10y ago

Nope. Here's the link to the paper Google team published in 2002.

Building high-level features using large scale unsupervised learning http://arxiv.org/abs/1112.6209

From the abstract: Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not.

The Cat detection thing was just a side product of learning to identify features of things in an unsupervised manner, but the news outlets locked on to that with titles such as "How Many Computers to Identify a Cat? 16,000" in NY Times.

Wasn't it amazing that they could distill the concept of cat from images with no help from external labels (human intervention)? They missed the core of the discovery by not understanding that.

The deep learning method is an unsupervised way to process raw input and transform it into useable features. This used to be done by a combination of domain knowledge and supervised training, but they could build an automated way to extract relevant features from images.

This opened the window for hope that one day neural networks will be easily applied to any new domain if there is sufficient raw data to build a deep network for it. In the past there was a need for a large investment in human based data labeling and how to extract the best features from raw data (also described as voodoo magic by the same researchers - it was hard, it was domain locked and expensive).

viksit10y ago

Thought: There's going to be a need for a very open platform that can do things like this, which will offset many of the worries that have been echoed on this thread today about one or a few corporations having access to everything.

To use an analogy - if messaging apps are the new "browsers", then content accessed through them are the new "websites". What FB is doing is the equivalent of AOL in the 90s.

What then, is the equivalent of a search engine like Google/Yahoo, in that world?

doublerebel10y ago

I believe we don't want an equivalent to Google/Yahoo -- we need an improvement over search. Rather than trusting corporations to deliver the knowledge we seek, we should rely on our personal trust graph -- like we did in the old days. Otherwise the constant influx of biased, irrelevant information will be overwhelming.

What if you could get a recommendation from your friend's friend without asking them, and without violating trust or privacy? This is what I am building today.

wallacrw10y ago

The search engine would have to be the NLP and information retrieval that's turning the requests into actions or answers.

Which is why I'll now plug the company I work for, MindMeld, since (i) we do that better than Wit and (ii) we are not feeding our data to an advertising team.

lbotos10y ago

For a period, a few people thought that twitter had enough sway to be that new "messaging search" engine. I've used it in such a way when I wanted to find hyper localized information.

mintplant10y ago

“The AI tries to do everything,” says Alex Lebrun, the founder of Wit.ai, a startup Facebook acquired to help build this smartphone tool. “But the AI is supervised by the people.”

Congrats to ar7hur! Here's the original Show HN introducing Wit.ai: https://news.ycombinator.com/item?id=6373645

rebootthesystem10y ago

User: "Hello M."

M: "How may I help you?"

User: "What are my options for deploying a Python/Django project and making sure it is setup for scalability from the start? Compare five hosting providers for me. No, I don't know what metrics I should look for. Please research these and let me know what they are when you deliver the report. I also need an objective evaluation of our project in order to determine the risks that might be involved in going with Python 3.x rather than 2.x in the context of the libraries we might need to use in the future. Analyze the nature of our application in order to determine what the applicable libraries might be. Also, go through PEP's and make me aware of anything that might be relevant to the above. You have one week."

M: "My responses are limited. Would you like me to find you a restaurant?"

User: "No. I've lived in this town all my life. I know where most restaurants are and I know the handful I frequent. I need help with real questions. I can get the latest weather report, I can find a restaurant, I can order pizzas, I can go to the drive-through if needed and I sure as hell am not going to plan a vacation for my family this way. What I could really use is having you run through seriously time-consuming research, summarize results and present them to me in an easy to consume form. What I could really use is having you save me from doing 40 hours of research across 100 websites. Food, the weather and vacations are not a problem."

M: "Ah, but there's a great new BBQ joint not too far from you"

User: "I'm vegetarian"

M: "My responses are limited. How would you like a thrilling and exciting hunting safari in Africa?"

User: ":-("

golergka10y ago

But if it could answer this question well, it would mean that you would be out of a job pretty soon and wouldn't be casually dining in restaurants on your unemployment check anyway.

JetSpiegel10y ago

M: Bing Edition

__michaelg10y ago

It looks like you're writing a letter. Would you like help?

pdeuchler10y ago

Is it just me or does this article read as a thinly veiled sales pitch to anyone else?

frostmatthew10y ago

http://www.paulgraham.com/submarine.html

stevesearer10y ago

Bingo.

Article pitches aren't all inherently bad ideas for articles either. A good example from my industry that I'm pretty sure was from a pitch is this one from the WSJ [1]. The basic concept of regaining focus at work is a strong one that resonates with people right now, but all the blog post ends up being is an ad for the product.

[1] http://blogs.wsj.com/atwork/2015/04/19/the-office-chair-desi...

ilaksh10y ago

My guess is that since people don't pay for subscriptions and rarely click on ads, Wired and some other publications make most of their money now from paid promotional 'journalism'. I would rather have that than nothing. Everything costs money.

stevesearer10y ago

This is more likely a case of Facebook "pitching" the story to Wired than Facebook paying Wired to run a paid promotional story. If you don't bite and do a story about the new Facebook thing, all of the other outlets will and you lose out on those potential readers. Because there are so many potential different outlets for where people can read about these bits of news, the PR people have the upper hand under the current views-based model.

tomg10y ago

Have an ad network buy products on my behalf? No thanks.

hk__210y ago

If the products match what you need or want, why would you refuse that?

tomg10y ago

The PA does not work for you, it works for FB. It has only FB's interest's in mind. You are not it's employer as you do not pay for it.

It's not in FB's interest to make honest recommendations. If Bob's Burgers is paying $1000/mo in FB ads, but Karen's Burgers keeps being recommended as the "good burger joint", how long before Bob stops buying ads? And why would Karen start buying FB ads since she's getting exposure for free?

4 more replies

viksit10y ago

Haha, it looks eerily similar to Myra, the cross platform assistant I launched last week [1]. Including the name. Interesting times.

[1] https://news.ycombinator.com/item?id=10060074.

mrwilliamchang10y ago

Myra looks similar Magic to which was launched earlier this year. Also has a name beginning with m. http://www.wired.com/2015/03/stump-magic/

viksit10y ago

True. Although, the name was chosen way earlier than Magic's launch actually! So it's a coincidence.

But it's all AI - no human assists :)

1 more reply

volaski10y ago

It sounds ridiculous to hear someone claim originality for an idea as generic AND faddish as text based assistant. If you seriously didn't know of apps like these existed before you launched (text based assistants with a human name), just google them up and you'll find tons launched since last year. I'm sure even YC has at least 3 companies that launched with this model.

BinaryIdiot10y ago

The part I find most interesting here is I'm working on something mildly similar in my spare time. Though it'll certainly be more limited than something a big company like Facebook can come up with but I'm tired to sending all my data every time I want to do something so I'm trying to squeeze this into a phone without the need for the internet to, at least, process commands. Oh and extending it will only require a little bit of JavaScript.

But I'm far away from launching it and it's only a side project. But it's cool to see so many in the space doing something I also want / wanted to do.

fizzbatter10y ago

For those of you more familiar with NLP, are there some "dumb but effective" techniques to approach https://wit.ai/ like functionality? (Libraries would be great, but i doubt there are any, for Golang)

I know NLP is difficult, and frankly i hate doing it, but i want an expressive language to "speak" to an internal process i use (a bot), and NLP seems like the only solution. I imagine a rule based approach is best (for my simple needs), but i have yet to see any examples that come close to wit.ai.

Appreciate any replies :)

viksit10y ago

See how annyang [1] does it. Forget the voice part (which it uses WebkitSpeech for). It's how they interpret commands that's probably useful in your case.

It's pretty good for a basic set and you can train more. Ultimately, you need something that is learning online and that will require an understanding of ML techniques such as CRFs.

[1] https://talater.com/annyang/

julien_c10y ago

You could have a look at AIML and something like Program AB.

dominotw10y ago

Calling this "AI" is bit of a stretch. Any software that responds in natural language is not automatically AI.

jhgg10y ago

Very interesting that the human trainers are being used to train the AI to eventually do their jobs.

rybosome10y ago

Jeremy Howard[0] gave a TED talk[1] in which he predicted that this would be a short-term trend, where labelling data for AI will be an easy way to get a job for a few years. He predicts that this will drop off as enough labelled data is provided. I think this fails to consider that our expectations of AI will increase along with our ability to manipulate increasingly large amounts of data, so we will begin labelling increasingly complex data.

[0]: https://en.wikipedia.org/wiki/Jeremy_Howard_(entrepreneur) [1]: http://www.ted.com/talks/jeremy_howard_the_wonderful_and_ter...

bla210y ago

The cycle of software life:

1. Motivated team in a larger company builds new, cool product (in this case Messenger) 2. It's good and becomes successful 3. The rest of the company wants to get in on that, think of ways to add value 4. A bunch of stuff gets bundled, some good, most bad 5. Some of the original team stay around, most get disillusioned and go work on something else 6. Eventually, the app becomes another iTunes

swalsh10y ago

I want Amazon to build a personal digital assistant, and then integrate it into filters. Today I was searching for socks, I care about 3 things, the size, the color, and whether they go up to the ankle or not. It seems like information they probably have (or a well trained net could figure out), so it would be nice if it was offered as a filter.

A few weeks ago I was trying to find toys for my son. I was most interested in "things for a 6 month old". They did have that filter, but it was 0 - 24 months. At this age a few months make a HUGE difference. I wish the box was a bit more fine grained.

Apocryphon10y ago

I wonder if Amazon leverages all of the human involvement in Mechanical Turk for any machine learning.

sib10y ago

Wait for Alexa (the "entity" behind Echo) to evolve a bit further and integrate more of what Amazon knows about you.

btbuildem10y ago

Ah, Mechanical Turk strikes again..

bm136210y ago

I worked in AWS for a bit and my favorite joke was to flippantly suggest mturk as a solution to some convoluted architecture/process.

msellout10y ago

That's not a joke, that's a legitimate solution.

1 more reply

viach10y ago

"It can purchase items, get gifts delivered to your loved ones, book restaurants, travel arrangements, appointments and way more"

So it can spend my money in behalf of me?

evincarofautumn10y ago

Of course, is that unclear? “M can actually complete tasks on your behalf. It can purchase items […]”

viach10y ago

Ahh, thank you for the explanation, I should really read more carefully! This is indeed a great application of AI related technologies.

Ohh, wait, can it do something else?

marcusgarvey10y ago

Facebook's answer to Magic?

kzhahou10y ago

Implying that Facebook built this in response to a service that had a weekend+ of buzz?

marcusgarvey10y ago

Provided that they could also see the long-term business case for it -- do you find that surprising?

apetresc10y ago

Anyone figured out how to sign up for the test? Is it a contact you can add to your Messenger list, like chatbots of old?

kirk2110y ago

Pretty cool. Written about Slack bots before and my main complaint was that I missed 'one bot to rule the all': https://medium.com/@RecurVoice/rise-of-the-slack-bots-5a7928...

zkhalique10y ago

My main question is - how did facebook make a HUGE picture show up when you share this page on facebook? Anyone know?

rwc10y ago

og:image - An image URL which should represent your object within the graph.

http://ogp.me/

zkhalique10y ago

But usually it shows up as a small image

Apparently this is 4 images and it shows up like this: https://www.dropbox.com/s/1n8wkixfimkmvlr/Screenshot%202015-...

Can anyone do this? If so, how exactly?

1 more reply

dhutchinson10y ago

I can appreciate FB trying to innovate, but with the on going privacy issues and the fact that it seems they are just repackaging existing tech, i'm just not into it.

cm201210y ago

This is basically a search engine. That is insane news for the advertising world if this is successful. Imagine FBs targeting + some intent information. I am slavering...

chimeracoder10y ago

I really love the logo. I kind of wish they'd made it a mobius strip (this one has two sides), but either way, it's awesome.

MikusR10y ago

That's Visual Studio logo.

BillTheCat10y ago

The old one is a mobius strip. The new one is more angular and 1 color. https://www.visualstudio.com/en-us/visual-studio-homepage-vs...

1 more reply

aiiane10y ago

I'm curious what the latency is like for interactions, given the human element.

andybak10y ago

The article title has the word 'Facebook' in whereas the post just mentioned 'Messenger'. Is 'Messenger' clear enough? I'm old enough to think that refers to Microsoft Messenger!

mcintyre199410y ago

Annoyingly Google have called their newest SMS thing Messenger too so it's an ambiguous term even on my phone.

mikeash10y ago

I assumed it was about Yahoo's product.

andybak10y ago

Mods have edited the title now. Good call.

umanwizard10y ago

It's M, not Q.

justinv10y ago

Exactly.

I assume OP was going for the James Bond feel, but it is M.

denzil_correa10y ago

> I assume OP was going for the James Bond feel

Well, M is also a fictional character in James Bond - head of MI6.

2 more replies

samuellavoie9010y ago

I had my hopes up, Expecting Q from star trek.

x5n110y ago

"STOP THIS RIGHT NOW, Q!" was going to be my next thought.

1 more reply

dang10y ago

We've changed the title. 'Q' for 'M' is unusual; if a typo, it was a non-Qwerty one.

ar7hur10y ago

It's M! http://www.wired.com/2015/08/how-facebook-m-works/

dang10y ago

Since that article contains more detail, we've changed the URL to it from https://www.facebook.com/Davemarcus/posts/10156070660595195. Happy to change it again if anyone can suggest a better.

andyl10y ago

What is the best alternative to Wit.ai, now that they have been consumed by Facebook??

mildbow10y ago

Why do you want an alternative?[0]

Afaik, they haven't been shut down. It's actually even free now.

[0] not that it's a bad thing, but wondering it's more than just "facebook bought it". Funny as it is, I trust that companies will go on when facebook buys them as opposed to google or amazon.

andyl10y ago

Heck - you are right! Brought up wit.ai earlier today and it rendered a blank page - thought they had been shuttered. But now I can see the full site and the service looks stronger than ever.

zkhalique10y ago

Don't you mean M?

We have been building Q ! :)

nedwin10y ago

Related: I really dig the work KitCrm.com are doing in making it easy for businesses to buy FB ads and do light marketing via SMS & messenger.

nedwin10y ago

ha! minus 4 points. Why the downvote?

j / k navigate · click thread line to collapse

169 comments

shostack10y ago

My issue with these services is they always tout these use cases like:

>"Can you make me dinner reservations?"

>"Can you help me plan my next vacation?"

Not trolling, serious question.

viksit10y ago

Agree. Those are too broad.

I'm thinking "get me a dinner reservation next sunday with patio seating for 5 in the east village at an upscale tapas place".

As I mentioned elsewhere on this page, my thesis around conversational interfaces isn't that they start off broad and use more Q/A to refine your query. That's slow, and people are visual.

unabst10y ago

Anecdotally, if we'd ask a real person "where is a good place to eat" the chance we'd go there without more information is slim. And if we don't even trust people, trusting Siri will be a while.

Is it a glitch in the Matrix or is their pitch for Cortana identical?

> What is Cortana? Cortana is your clever new personal assistant.[1]

[0] https://en.wikipedia.org/wiki/Office_Assistant [1] http://windows.microsoft.com/en-us/windows-10/getstarted-wha...

2 more replies

shostack10y ago

Thanks for helping me get to the meat of what I was trying to communicate.

1 more reply

pbreit10y ago

"get me a dinner reservation next sunday with patio seating for 5 in the east village at an upscale tapas place"

Nope. I don't think anyone would ever leave it up to M (or whatever) to select the restaurant.

2 more replies

rocky113810y ago

We also tend to use very subjective terms like "best," e.g., "where's the best place for food in Taipei?"

What is "best" and to whom? Ideally the software would figure this out but I'd always be wondering if it was just going to TripAdvisor and grabbing the first result.

tsurantino10y ago

I think the idea with a conversational interface is that it's succinct and on-demand. You receive the most relevant information directly in as simple of an interface as possible (arguably).

viksit10y ago

It's also extremely slow.

I don't think a conversational interface _replaces_ a visual one.

It's that the initial query can be complicated, and it allows you to get into that 5-6 tier deep part of your search that you would have gotten to by using 5 filters and scrolling through 50 results.

pbreit10y ago

If you look at Amazon Echo, the stuff that works is pretty specific:

Set timer for 5 minutes

Add eggs to shopping list

Play Clocks by Coldplay

Weather

1 more reply

gervase10y ago

I completely agree.

Just my 2 cents.

vidarh10y ago

jrub10y ago

I actually don't want a machine to make decisions for me. I want a machine to do what I tell it to do, or present me with I formation required to make a decision.

If I'm looking for a restaurant, show me the options, give me their distance, top reviews, and some of their dishes. If I want reservations, I'll tell it when and for how many.

shostack10y ago

100% agree. I'd refine it slightly by saying it isn't just a recommendation we want, it is presenting us with the logic under the hood in terms of HOW it made the decision--not what the decision was.

dbot10y ago

Agree completely. I saw a similar issue with sites like Operator, Magic, etc. The requests were very vague, making me wonder, "Am I spending too much time thinking about where to order a pizza from?"

And if I know which pizza shop I want to order from, what's the benefit of adding an intermediary?

vidarh10y ago

If want something new, or I'm somewhere I haven't been before, that's different - then I'll be spending time looking at the menu etc.

bobbles10y ago

I guess there could be "Hey M, order my usual from the pizza place" or something like that.

But until it elevates from 'digital assistant' level to just 'assistant' (ie. do all the work and just confirm with me before booking) it may not take off as they expect it to

Johnie10y ago

Have you ever used a concierge service either at a hotel or over the phone? It usually takes a form of a back and forth conversation to identify what you really want.

That's the difference between talking to a real person (or good NLP) and a search query.

mkopinsky10y ago

TeMPOraL10y ago

> When I'm picking something as simple as a restaurant, I typically want options, I want to read reviews, I want to consider distance, parking, attire, etc.

For me, a lot of what you're doing here is the work that should be done by a machine. Considering "distance, parking, attire, etc." is basically what we have simplex method for.

shostack10y ago

Different filters might carry different weights, and the weights might change depending on their combination, or unknown outside factors.

I guess it just seems incredibly inefficient compared to checking a couple boxes and reviewing a list along with a map or other visual aid.

corkill10y ago

"Can you make me dinner reservations?" would lead to a response like "Any preferences on the type of food and location?"

Over time they learn your preferences so they don't need to ask location for example next time.

Then they send you the best options they found (and the benefits of each one and price range) then you reply back option 1 and they book it.

choppaface10y ago

deepGem10y ago

Bassed on my understanding of semantics and Knowledge engineering, this is doable.

murbard210y ago

mbesto10y ago

The article on HN changed. The original one from Facebook directly made way more sense and didn't use such generic suggestions.

rohunati10y ago

no, i think you're spot on. it's always interplay between reading reviews, while factoring in cuisine, distance and price.

roymurdock10y ago

cylon1310y ago

Relevant skit: https://www.youtube.com/watch?v=d4WrPkKc2Wg

mbesto10y ago

The first thing I see when someone asks "find me a good burger place in Chicago" is "how can companies game this through official ($) or artificial (spam) means?"

bentcorner10y ago

mikeash10y ago

Once this service is released to the public, you should definitely ask it that second question and see how it responds.

TeMPOraL10y ago

Yup. That's why believe that any "personal assistant" technology run by a commercial third party will be shit - it'll be used to try and sell you stuff, not recommend actually good options.

vidarh10y ago

ErikAugust10y ago

I think the best way to go around this is use Tinder to get matches with locals, then ask them through chat.

pen2l10y ago

The first thing I thought after seeing that example is FB is the new Yelp. Restaurants will have to pay FB for visibility and recognition...

Same thing as Yelp, just a lot scarier.

jameshart10y ago

knn10y ago

3 more replies

msvan10y ago

acchow10y ago

> Facebook already has two top-50 apps

I count 4: Facebook, Messenger, Whatsapp, Instagram.

robinson7d10y ago

Roodgorf10y ago

5 more replies

differentView10y ago

Maybe after seeing their smart phone flopped, they're going with an inside-out approach.

gearoidoc10y ago

Good point. Unsure why you've been downvoted for this.

teaneedz10y ago

The privacy implications would keep me from recommending this to others or even trying it out myself.

golergka10y ago

> promoting them to that point is expensive

visarga10y ago

ot10y ago

chillacy10y ago

That's a semantic argument more than anything. A small furry mammal with four legs, a long tail, whiskers, and pointy ears is what we'd call a cat, no matter what word you assign to it.

2 more replies

bhouston10y ago

It may have used tags on YouTube videos to identify which had cats. Not sure if that counts as completely unsupervised.

visarga10y ago

Nope. Here's the link to the paper Google team published in 2002.

Building high-level features using large scale unsupervised learning http://arxiv.org/abs/1112.6209

Wasn't it amazing that they could distill the concept of cat from images with no help from external labels (human intervention)? They missed the core of the discovery by not understanding that.

viksit10y ago

To use an analogy - if messaging apps are the new "browsers", then content accessed through them are the new "websites". What FB is doing is the equivalent of AOL in the 90s.

What then, is the equivalent of a search engine like Google/Yahoo, in that world?

doublerebel10y ago

What if you could get a recommendation from your friend's friend without asking them, and without violating trust or privacy? This is what I am building today.

wallacrw10y ago

The search engine would have to be the NLP and information retrieval that's turning the requests into actions or answers.

Which is why I'll now plug the company I work for, MindMeld, since (i) we do that better than Wit and (ii) we are not feeding our data to an advertising team.

lbotos10y ago

For a period, a few people thought that twitter had enough sway to be that new "messaging search" engine. I've used it in such a way when I wanted to find hyper localized information.

mintplant10y ago

“The AI tries to do everything,” says Alex Lebrun, the founder of Wit.ai, a startup Facebook acquired to help build this smartphone tool. “But the AI is supervised by the people.”

Congrats to ar7hur! Here's the original Show HN introducing Wit.ai: https://news.ycombinator.com/item?id=6373645

rebootthesystem10y ago

User: "Hello M."

M: "How may I help you?"

M: "My responses are limited. Would you like me to find you a restaurant?"

M: "Ah, but there's a great new BBQ joint not too far from you"

User: "I'm vegetarian"

M: "My responses are limited. How would you like a thrilling and exciting hunting safari in Africa?"

User: ":-("

golergka10y ago

But if it could answer this question well, it would mean that you would be out of a job pretty soon and wouldn't be casually dining in restaurants on your unemployment check anyway.

JetSpiegel10y ago

M: Bing Edition

__michaelg10y ago

It looks like you're writing a letter. Would you like help?

pdeuchler10y ago

Is it just me or does this article read as a thinly veiled sales pitch to anyone else?

frostmatthew10y ago

http://www.paulgraham.com/submarine.html

stevesearer10y ago

Bingo.

[1] http://blogs.wsj.com/atwork/2015/04/19/the-office-chair-desi...

ilaksh10y ago

stevesearer10y ago

tomg10y ago

Have an ad network buy products on my behalf? No thanks.

hk__210y ago

If the products match what you need or want, why would you refuse that?

tomg10y ago

The PA does not work for you, it works for FB. It has only FB's interest's in mind. You are not it's employer as you do not pay for it.

4 more replies

viksit10y ago

Haha, it looks eerily similar to Myra, the cross platform assistant I launched last week [1]. Including the name. Interesting times.

[1] https://news.ycombinator.com/item?id=10060074.

mrwilliamchang10y ago

Myra looks similar Magic to which was launched earlier this year. Also has a name beginning with m. http://www.wired.com/2015/03/stump-magic/

viksit10y ago

True. Although, the name was chosen way earlier than Magic's launch actually! So it's a coincidence.

But it's all AI - no human assists :)

1 more reply

volaski10y ago

BinaryIdiot10y ago

But I'm far away from launching it and it's only a side project. But it's cool to see so many in the space doing something I also want / wanted to do.

fizzbatter10y ago

Appreciate any replies :)

viksit10y ago

See how annyang [1] does it. Forget the voice part (which it uses WebkitSpeech for). It's how they interpret commands that's probably useful in your case.

It's pretty good for a basic set and you can train more. Ultimately, you need something that is learning online and that will require an understanding of ML techniques such as CRFs.

[1] https://talater.com/annyang/

julien_c10y ago

You could have a look at AIML and something like Program AB.

dominotw10y ago

Calling this "AI" is bit of a stretch. Any software that responds in natural language is not automatically AI.

jhgg10y ago

Very interesting that the human trainers are being used to train the AI to eventually do their jobs.

rybosome10y ago

[0]: https://en.wikipedia.org/wiki/Jeremy_Howard_(entrepreneur) [1]: http://www.ted.com/talks/jeremy_howard_the_wonderful_and_ter...

bla210y ago

The cycle of software life:

swalsh10y ago

Apocryphon10y ago

I wonder if Amazon leverages all of the human involvement in Mechanical Turk for any machine learning.

sib10y ago

Wait for Alexa (the "entity" behind Echo) to evolve a bit further and integrate more of what Amazon knows about you.

btbuildem10y ago

Ah, Mechanical Turk strikes again..

bm136210y ago

I worked in AWS for a bit and my favorite joke was to flippantly suggest mturk as a solution to some convoluted architecture/process.

msellout10y ago

That's not a joke, that's a legitimate solution.

1 more reply

viach10y ago

"It can purchase items, get gifts delivered to your loved ones, book restaurants, travel arrangements, appointments and way more"

So it can spend my money in behalf of me?

evincarofautumn10y ago

Of course, is that unclear? “M can actually complete tasks on your behalf. It can purchase items […]”

viach10y ago

Ahh, thank you for the explanation, I should really read more carefully! This is indeed a great application of AI related technologies.

Ohh, wait, can it do something else?

marcusgarvey10y ago

Facebook's answer to Magic?

kzhahou10y ago

Implying that Facebook built this in response to a service that had a weekend+ of buzz?

marcusgarvey10y ago

Provided that they could also see the long-term business case for it -- do you find that surprising?

apetresc10y ago

Anyone figured out how to sign up for the test? Is it a contact you can add to your Messenger list, like chatbots of old?

kirk2110y ago

Pretty cool. Written about Slack bots before and my main complaint was that I missed 'one bot to rule the all': https://medium.com/@RecurVoice/rise-of-the-slack-bots-5a7928...

zkhalique10y ago

My main question is - how did facebook make a HUGE picture show up when you share this page on facebook? Anyone know?

rwc10y ago

og:image - An image URL which should represent your object within the graph.

http://ogp.me/

zkhalique10y ago

But usually it shows up as a small image

Apparently this is 4 images and it shows up like this: https://www.dropbox.com/s/1n8wkixfimkmvlr/Screenshot%202015-...

Can anyone do this? If so, how exactly?

1 more reply

dhutchinson10y ago

I can appreciate FB trying to innovate, but with the on going privacy issues and the fact that it seems they are just repackaging existing tech, i'm just not into it.

cm201210y ago

This is basically a search engine. That is insane news for the advertising world if this is successful. Imagine FBs targeting + some intent information. I am slavering...

chimeracoder10y ago

I really love the logo. I kind of wish they'd made it a mobius strip (this one has two sides), but either way, it's awesome.

MikusR10y ago

That's Visual Studio logo.

BillTheCat10y ago

The old one is a mobius strip. The new one is more angular and 1 color. https://www.visualstudio.com/en-us/visual-studio-homepage-vs...

1 more reply

aiiane10y ago

I'm curious what the latency is like for interactions, given the human element.

andybak10y ago

The article title has the word 'Facebook' in whereas the post just mentioned 'Messenger'. Is 'Messenger' clear enough? I'm old enough to think that refers to Microsoft Messenger!

mcintyre199410y ago

Annoyingly Google have called their newest SMS thing Messenger too so it's an ambiguous term even on my phone.

mikeash10y ago

I assumed it was about Yahoo's product.

andybak10y ago

Mods have edited the title now. Good call.

umanwizard10y ago

It's M, not Q.

justinv10y ago

Exactly.

I assume OP was going for the James Bond feel, but it is M.

denzil_correa10y ago

> I assume OP was going for the James Bond feel

Well, M is also a fictional character in James Bond - head of MI6.

2 more replies

samuellavoie9010y ago

I had my hopes up, Expecting Q from star trek.

x5n110y ago

"STOP THIS RIGHT NOW, Q!" was going to be my next thought.

1 more reply

dang10y ago

We've changed the title. 'Q' for 'M' is unusual; if a typo, it was a non-Qwerty one.

ar7hur10y ago

It's M! http://www.wired.com/2015/08/how-facebook-m-works/

dang10y ago

Since that article contains more detail, we've changed the URL to it from https://www.facebook.com/Davemarcus/posts/10156070660595195. Happy to change it again if anyone can suggest a better.

andyl10y ago

What is the best alternative to Wit.ai, now that they have been consumed by Facebook??

mildbow10y ago

Why do you want an alternative?[0]

Afaik, they haven't been shut down. It's actually even free now.

[0] not that it's a bad thing, but wondering it's more than just "facebook bought it". Funny as it is, I trust that companies will go on when facebook buys them as opposed to google or amazon.

andyl10y ago

Heck - you are right! Brought up wit.ai earlier today and it rendered a blank page - thought they had been shuttered. But now I can see the full site and the service looks stronger than ever.

zkhalique10y ago

Don't you mean M?

We have been building Q ! :)

nedwin10y ago

Related: I really dig the work KitCrm.com are doing in making it easy for businesses to buy FB ads and do light marketing via SMS & messenger.

nedwin10y ago

ha! minus 4 points. Why the downvote?

j / k navigate · click thread line to collapse