Chatbots are not the future (opens in new tab)

(wattenberger.com)

161 pointspaulshen3y ago152 comments

152 comments

> When I go up the mountain to ask the ChatGPT oracle a question, I am met with a blank face. What does this oracle know?

I think if your attitude is that its an oracle, then you already have the wrong attitude for using the tool. Chatgpt is a tool, if you dont know how to use the tool, stop complaining that you don't like it. Imagine telling everyone scalpels are horrible tools because they tried to perform surgery on someone and botched it up.

One day, not too far off, we are going to be able to tell a 'chat bot': make me a cartoon, it takes place in a steam punk fantasy. make it 2 seasons with 22 episodes each season. great, add a cliff hanger at the end of episode 11. add a love story component to it. reduce it back down to 1 season but 44 minute episodes.

content creation is no longer going to be tied back to knowing how to draw cartoons, or have armies of writers. yes, we can get garbage out of the system. but its a tool, plenty of tools produce garbage results if you dont know how to use them.

As an experiment, I asked chatgpt to write a business plan (one my brother started). The business plan was very close to what my brother produced, after working on it for a month. That's powerful, that's worthy of being 'the future'.

hammyhavoc3y ago

Yes, sure. But you know what makes most of human history's output of art, music, literature et al great? Intent, attention to detail and self-expression.

Broad strokes are broad strokes. I can procedurally generate levels all day long in a video game, but for me, they're never going to be as compelling or interesting as a lower-resolution and low quality textured game from the '90s or '00s where every single tree and rock is placed with intent.

I already think modern cartoons are fairly sterile and soulless versus their hand drawn or hybrid counterparts. It isn't even elitism, they just don't hold my attention or interest me artistically, stylistically, or in terms of content.

If you choose to express yourself in broad strokes, that's fine, whatever floats your boat. I'll continue to chase things that have intent and artistry behind every aspect of them. Generic and formulaic is generic and formulaic all day long. It's also why I don't like most modern anime, it's sterile visually and isn't why I enjoyed the medium.

anonylizard3y ago

Really?

For a writer, if you write out a plot, you can get the AI to actually simulate the character's responses and dialogue (even voiced by AI!). There, you've LITERALLY brought a character to life, each character is driven by a different persona simulated by a different AI, the quality of stories that will create, will annihilate what came before.

You want to write an adventure, but want to keep it unpredictable. Ask the AI for ideas, there, the adventure is now a true adventure, not a fake mirage created by the writer.

No need to describe scenery, no need to describe character appearances. Feed those descriptions into txt2img, and you get portraits that would have cost $1000/pic from top tier artists.

Generic and formulaicness, comes from having TOO MANY PEOPLE. Too many people involved in production, means the creator must dilute intent, appeal to wider audiences, and limit risks, to ensure costs are reclaimed. Once AI gets going, you'll see indie creators making full anime series, and releasing them on youtube. Because for an individual creator, even ad + patreon revenue alone would be able to sustain a comfortable existence, with no dependency on corporate or teams.

I thought people who love art, would be exhilarated by AI. I realized, the majority of artists don't love art. They love drawing, but not art. They love socializing with artists, but not art. They love receiving attention and income from their art, but not art. That's all fair and fine. But there will be people, who just want to create the best possible art, no matter the method, no matter the reward, and with AI, this latter group will outcompete the first, hard.

6 more replies

moffkalast3y ago

> I can procedurally generate levels all day long in a video game, but for me, they're never going to be as compelling or interesting as a lower-resolution and low quality textured game from the '90s or '00s where every single tree and rock is placed with intent.

The popularity of Minecraft and other procedural games would imply that there is still a large number of people who value exploring the unknown generated content, even if it means it's not curated.

Yes the quality won't be as good, but you do get quantity instead. And the quality will improve.

2 more replies

Ferret74463y ago

> But you know what makes most of human history's output of art, music, literature et al great? Intent, attention to detail and self-expression.

What makes them great is that people enjoy them. Whether they were created with "Intent, attention to detail and self-expression" or monkeys banging on typewriters is irrelevant and indistinguishable.

1 more reply

lannisterstark3y ago

>I'll continue to chase things that have intent and artistry behind every aspect of them.

For someone with that POV you sure are peddling the "Everything is soulless, stop enjoying things" perspective.

jutrewag3y ago

Procedural generation is miles away from what these LLMs are doing. It’s not just coming up with a random maze/map for a fetch mission. Right now, it can generate a novel quest, with all of its characters, come up with a unique set of mission goals and theoretically generate those assets and characters (atleast in 2D). Turns out, all this self-expression or whatever is cheap.

1 more reply

autokad3y ago

I'm not arguing artists are going to go away, just that story telling is going to become a lot easier. it also doesn't mean the content produced is necessarily painted with 'broad strokes' and absolutely doesn't mean its soulless. It means the barriers to entry for producing stories just came down a whole lot - akin to the printing press did for books.

1 more reply

root_axis3y ago

ChatGPT including the GPT-4 variant sucks quite terribly at creative writing, especially the kind of writing necessary to create long-form narratives like that in serial television and especially novels. The technology will certainly improve and it will definitely become a staple in the editor's toolbox, but it is so far away from being able to produce long form narratives well that it's not a given it will eventually get there.

barking_biscuit3y ago

It's just down to small context window. Once the context window is big enough to fit entire examples in the training data, then it should be trivially solvable to train a model to do it.

There are various finetuned models out there for conversations or story-telling, though they're quite small in terms of parameter count at the moment, but I don't see it as being fundamentally impossible.

regularjack3y ago

Scalpel manufacturers don't advertise their scalpels as capable of performing surgery on their own.

warent3y ago

Do you have an example of AI false advertising?

1 more reply

anonymouskimmer3y ago

> As an experiment, I asked chatgpt to write a business plan (one my brother started). The business plan was very close to what my brother produced, after working on it for a month. That's powerful, that's worthy of being 'the future'.

How much knowledge and ideas did your brother personally develop over this month in addition to the business plan? Being handed a working plan is sometimes less useful than the aggregate experience leading up to the plan.

hammyhavoc3y ago

Case-in-point: why do some biz succeed under one CEO then fail under another quite unexpectedly? The former CEO likely understood their market, their product, their business model, maybe they even dogfooded and used what they made.

The "why?" is frequently more important than the "what?" with biz. An LLM doesn't really have an understanding of how the world works, so I would be very sceptical of its ability to write a sensible business plan. It doesn't even understand the product/service, and the nuances of what it does because it can't experience it.

The LLM is a yes-man with no experiences.

Imagine taking a year-old old book about what worked for someone's biz years ago, then naively assuming it'll work for yours today without considering how the world, market, people, technology et al may have changed in that time. This is the same thing. It assumes x is x. Usually x is not x. People used to compare their biz to Apple, despite not even being in remotely the same industry.

ambicapter3y ago

Every "impressive chatgpt" story I hear is about someone comparing what chatgpt produced to what a human produced, and saying it's scary close. No one talks about all the times they asked chatgpt to produce something, didn't compare it to what a human could do, and then realized it was catastrophically wrong once implemented.

barking_biscuit3y ago

I suppose all that really matters if the ratio of the two? I have use-cases for which I use GPT-4 and it does as good a job as I would, but faster and so I just leverage it by default now and am faster at certain things. There are also things I try it for and I don't get a useful result, so I learn not to rely on the model for that particular use-case. In that regard, it's a tool and you simply learn what it's useful for and what it's not.

1 more reply

vkou3y ago

> Chatgpt is a tool, if you dont know how to use the tool, stop complaining that you don't like it.

It's a tool that can be opaquely configured to be used in a million different ways, and when using it does not bring about the desired result, its acolytes sneer, and suggest that you're using it wrong.

It's like a multi-tool, that only works when you're blindfolded. Sure, it can be used to hammer nails and tighten screws and strip wires and measure a tire's pressure, but it makes it quite difficult to find the magical incantation that will apply the right end to the job.

(And most of the time, it quietly leaves the screw untightened, the wire clipped, and the tire with a hole in it. It's the user who's wrong, of course.)

tootie3y ago

I think the point of the article is that this does not constitute a chatbot. It's not conversational. It's not really general-purpose. That doesn't mean it isn't powerful, just that telling users to have a freeform conversation isn't going to work. It's also why everyone is getting excited for "prompt engineering" to make better use of it. The biggest user value is still going to be a level above the open-ended chat UI we have right now. He's not saying GPT is useless, he's saying we haven't put it into it's optimal context yet.

MuffinFlavored3y ago

> Chatgpt is a tool

When you use a hammer or a drill, do you expect it to sometimes not hit/screw the nail?

If ChatGPT is a tool for knowledge transfer/extraction, it can't hallucinate/lie to you/be wrong/make stuff up.

If it's a tool for potentially discovering some knowledge that may be true and needs to almost always be verified by either a compiler or a followup "find me a reference/discussion" Google search to make sure it's accurate, then sure. But I don't think that's what it's primarily being advertised as.

brokencode3y ago

Beyond the obvious fact that you can accidentally hit your thumb with a hammer or strip the head off a screw with a screwdriver, I’d very much like to hear about any tool for collecting knowledge that is perfect.

Web searches will for sure give you wrong answers. Even professors or other experts in a field will be wrong sometimes. Heck, even Einstein got some things wrong.

Your goalpost is in the wrong spot. Tools don’t need to be and probably never can be perfect. But that doesn’t mean they’re not useful.

1 more reply

ukuina3y ago

> When you use a hammer or a drill, do you expect it to sometimes not hit/screw the nail?

Not sure about drills, but this absolutely happens with drivers if you fumble mating the bit to the screw head, or if you miss the stud, or if you overtighten, or if you don't sometimes pre-drill, or if you strip the head, or if you don't correctly gauge underlying material composition, or thickness, or if you...

willio583y ago

Exactly. I use ChatGPT for help coding sometimes and it's like 50/50 if I get an answer that is truly helpful, but that's infinitely better of a tool than I thought we'd have 1 year ago.

1 more reply

itake3y ago

> Good tools make it clear how they should be used.

This is such a weird statement from someone in the tech space. Programming languages rarely have an opinion for how they are to be used (for example JS MUST only run the browser or which code style to use).

When I chat with customer support, I wish they could meet me where I am instead of me needing to learn their tools. For example, I want to say "cancel my subscription" and my subscript get cancelled. I don't want to have to figure out which sub menu of the sub menu that has the magic "end subscription" button.

I know how to use my tool (english). LLMs teach computers how to use that tool too.

karmakurtisaani3y ago

I think the problem in your example of cancelling the subscription is that service providers often make it difficult on purpose. I doubt they'll allow chatbots to make it any simpler.

itake3y ago

I get support emails for "how do I cancel my Apple App Store Subscription?" when Apple governs the cancellation process in a centralized and simple manner.

I also get support emails for password resets, which I try to make as simple as possible.

People don't want to learn new tools if their existing tools (language) work just fine.

phillipcarter3y ago

> Programming languages rarely have an opinion for how they are to be used

Erm, they absolutely have an opinion. That's why I can't just write however I like in whatever language. I need to stick to the designer's opinions on syntax and semantics otherwise it won't work.

itake3y ago

When chatting with a Chatbot, you also would need to stick with the language's syntax and semantics otherwise it won't work. You can't just write however and whatever you want and expect the bot to understand you.

rurp3y ago

Subscription cancellation buttons are intentionally confusing and hard to find. It's easy to add a big red Cancel button in an obvious place that works immediately, but companies choose to avoid that route in order to extract more money from customers. Nothing about that dynamic will change with LLMs, those same companies will have the same priorities.

The only difference will be that script-following customer service reps giving you the runaround will be replaced by indefatigable chatbots giving you the runaround, which honestly sounds pretty hellish to me.

itake3y ago

> Subscription cancellation buttons are intentionally confusing and hard to find.

really? I feel like Apple's App Store provides great UX with their warning emails and centralized subscription management view. It is well documented too: https://support.apple.com/en-us/HT202039

But I still get emails asking me to cancel subscriptions.

raincole3y ago

It's just ridiculous. The author even stated:

> Compare that to looking at a typical chat interface. The only clue we receive is that we should type characters into the textbox. The interface looks the same as a Google search box, a login form, and a credit card field.

And Google is one of the most used tools. Probably more used than pen and paper today.

mxuribe3y ago

Actually, as i recall, that statement is somewhat foundational for human computer interaction...of course my recollection is from my HCI college course a couple of decades ago...But, yeah, whenever i make something - digital or otherwise - i hope to design it properly enough that the intended user intuitively understands how it should be used. (I do add documentation beyond the base design, but because i want to further help the person.)

rocketbop3y ago

I don't know why it hadn't occurred to me before now, that using ChatGPT is quite to similar to playing Zork and Infocom games from the 1980s, with less trial and error needed to get something out of it.

The point and click adventure games from Sierra and Lucas Arts were a huge step forward in interaction, although you didn't have to use your imagination as much to solve the puzzles.

And here we are again asking users to type their way to success.

notahacker3y ago

The other obvious UX comparison point is with sending instant messages to a person until they get it right....

Provided the responses aren't too brittle (and the LLM getting it wrong isn't too upsetting or find-out-too-late) lots of non power-users are going to prefer it, at least in cases where a menu or form input with about six options won't suffice.

cout3y ago

Chatgpt or a successor dynamically generating in-game dialogue could be fun. Or maybe it won't be. I'm interested in seeing it done.

graiz3y ago

Chatbots are the future but your points are valid. They don't provide affordances, however chatbots provide a form of progressive disclosure and direct interaction that was previously impossible.

Toolbars and menus provide affordances but you still need to know what things are called and what order to use them. "I'd like to email this file as a PDF and I'd also like to print it." may be much easier in a chat UX than in a menu based UX. Often these things can co-exist but chatUX has access to much more nuanced UI that would otherwise be too complex to build or expose.

xtiansimon3y ago

> “Chatbots are the future…”

I’m distressed by the growing use of chatbots in online payments. You do know the ontology—get balance information, make a payment, customer service.

I much prefer to not speak aloud to a robot on the phone, especially in the office, when there should only be three options.

Conversations with a robot speaking in a mixed tone of obsequiousness and superciliousness make me bananas.

sitkack3y ago

> Text inputs have no affordances

It has all the affordances. You can turn it into anything you want. Want it respond with json, check, it can do that. Turn a wall of text into a Python data structures, check, it can do that too.

You can take your LLM text interface and put what ever api you want on top. You start with clay and mold it into anything you need. You construct a parser so that the way data is already constrained and validated. Same goes for the output.

tdaltonc3y ago

Whether you want to call it an "empty set" or "the set of all possible sets" isn't really relevant to the authors point that an empty box has discoverability problems.

sitkack3y ago

I regret posting. The article itself is a trap. As a developer, the chatbot interface is everything to everyone, as a developer. The application user of course needs a rich interface with affordances.

I am not arguing that the chat interface is everything to everyone from a design perspective. I do think that as UI workbenches improve the need for a dedicated application will nearly disappear though. Applications will become a 1-5 page specification, including the UI.

kmtrowbr3y ago

I thought this was a funny statement too. Text is the most powerful, original medium. It's difficult to overstate how powerful and valuable it is. Of course, text won't necessarily be the only UI exposed for interacting with these "Large Language Models." But that UI can be built on top of the text -- which wouldn't work in the other direction.

alanbernstein3y ago

I think the point is that a "chatbot" is, by (the author's) definition, a UI with only a bare text prompt. Once you start building more UI on top of that, you're ... doing what the article suggests.

1 more reply

JusticeJuice3y ago

I dunno, I think his point is very valid. Chat bots can't do literally anything at all, and an empty text input isn't going to help guide a user towards what it can do, and what it's good at. Just because a system has a LLM to interact with it, doesn't mean it'll suddenly support any desired action the user wants done.

zmmmmm3y ago

Yeah this was so weird to read. This is probably the number one property I think makes interfaces like ChatGPT compelling - you don't need to know how to use it - just use the human language you already know. If you don't understand something it says, just ask it to explain it. Essentially, it makes affordances obsolete.

disconcision3y ago

> Text inputs have no affordances

>It has all the affordances.

these are two different ways of saying the same thing

lxgr3y ago

But what if I don't know what I want?

A graphical UI can provide much more and much more intuitive guidance than a chat input ever will. And I say that as a big fan of Unix and the shell.

IanCal3y ago

You can ask it. You can explain your problem and ask how it might be able to help. You can discuss and narrow down with some back and forth what it is you want to do.

I could tell a chat bot I am finding the horizontal split in my editor is annoying because I have a wide monitor, and have it tell me there's a setting for that and ask if I want the default changed.

With a gui I might have to go through the files menu for settings, check if it's in edit-preferences, check tools-options, before maybe having to find out online it's if it's in some settings file.

1 more reply

paulddraper3y ago

Quality quote:

"There's an ongoing trend pushing towards continuous consumption of shorter, mind-melting content. Have a few minutes? Stare at people putting on makeup on TikTok. Winding down for sleep? A perfect time to doomscroll 180-character hot takes on Twitter."

BulgarianIdiot3y ago

140 chars, he means.

*280 chars, he means.

*4000 chars, he means.

*10000 chars, he means.

skybrian3y ago

*she

itsuka3y ago

Someone said that using a chat-only interface is like using CLI tools. It's even worse, I would add, because there is no autocomplete or man command to help you out. Most of us here probably get it, but my dad had a hard time getting a good answer from GPT when he first tried it, even with GPT-4 model.

LesZedCB3y ago

isn't it ironic that there's no autocomplete on the tool that's literally autocomplete? haha

where's the fine-tuned prompt helper model?

andrepd3y ago

We clearly need to ask chatgpt to write a model for the prompt helper model.

abe-1013y ago

Both Bard and Bing chat have auto-complition

twelve403y ago

Hence the need for specially trained people who can do the back-and-forth with humans to understand exactly what is required, then enter it (into the brittle prompt in this case) in a way that the computer would understand.

moffkalast3y ago

Or more like a basic understanding of what the thing can do and what it can't do. It's not that complicated.

c7b3y ago

Not sure I'm convinced - natural language is one of the most intuitive interfaces we have, it's also how most instructions in professional contexts are delivered. What current ChatBots are missing right now isn't radiobuttons for different styles but context. No one who just reads my messages can know what kind of style I'm looking for until they either get laborious instructions, or, probably better, they see some examples.

I'm guessing that training custom language models on a company's data must be one of the hottest things you can be working on right now if you're looking for VC money (if there's something out there that compares to how well StableDiffusion+Dreambooth works for images, I'd be thankful for any pointers).

skybrian3y ago

Learning how to use freeform text input can be pretty annoying when only a few things work. Some examples are playing a text adventure ("guess the verb") and using an unfamiliar command line interface. Good error messages can help.

Web search changed that. Most queries work, at least somewhat.

There's a point where freeform text input becomes better than structured input. A simple search box is what people mostly use instead of an advanced search form, let alone a web directory (like Yahoo! back in the day).

For web search, there are very few error messages. If you enter a query that doesn't work very well, you get back results that aren't very good or what you wanted, so you try something else.

With AI chatbots, expectations are sky-high, but there are times when they should refuse with a good error message, because they really can't do what you're hoping to do. An example is when you ask it to explain its reasoning. An LLM never knows why it wrote what it did, but it will try to invent a plausible explanation anyway. [1]

Better error messages that help users understand what chatbots can actually do would help avoid misconceptions, but this won't happen unless the error messages are trained in.

[1] https://skybrian.substack.com/p/ai-chatbots-dont-know-why-th...

JusticeJuice3y ago

Yeah, exactly. A freeform input promises it can do 'anything at all' but if it can't in reality, it's always a frustrating interface.

Classic example, Siri. It's so easy to quickly find stuff you feel like it should be able to do, but it just can't. "When was my last message from Steve" etc.

wilg3y ago

Yeah, chat isn’t a universally great interface. But it’s a great default because it’s totally free form.

It should be pretty easy (even with today’s APIs and technology) to have an LLM design a user interface for you for your current task.

Simplest way: output a JSON of simple control definitions with every answer.

Coolest way: Just have it generate a full-ass React front end or whatever on every message.

fatherzine3y ago

Interesting. Futuristic idea: on-the-fly generate a front-end customized to the user learned preferences. Motivation: humans are poor learners, so save them the trouble to learn a new interface style. At the limit, real-time generate a whole world-in-the-world, fulfill Zuckerberg's dream -- an utterly lonely Matrix. Now that's a thinly sugar-coated version of the Hell described in e.g. Catholic doctrine. And we will chose it ourselves. Deeply sobering.

efields3y ago

We are very good learners. That’s how we got this far. Not all new interface is good and worth learning. Sometimes it feels best to stick with what works.

uxabhishek3y ago

Contextually relevant suggested options (that can be acted upon with a single click), alongside the free form input box will emerge as the norm.

Down the line people will expect applications to be chat ready. They will see an input box and expect the application to understand natural language and respond in the most helpful way. Which might be showing an error message or suggesting relevant next steps.

swyx3y ago

Amelia presented a bit more on her demo at our meetup last week:

https://www.latent.space/p/build-ai-ux

full recording in the video at the bottom!

dahwolf3y ago

A lot of people are projecting for ChatGPT to come for the search market, but I wonder how that will play out for lowly cognitive queries.

Quite a few people use search by awkwardly typing on mobile one or two words, probably misspelled and/or auto-completed as they type it. The query isn't sophisticated, lacks a lot of context and parameters, which the search engine then tries to guess.

When you use ChatGPT in that way, you'll get useless generic answers. It seems to shine specifically when being more specific, detailed, which also suggests users are willing and able (education level) to give such rich input.

The idea that it's better than search for this specific normy behavior, I openly question. And let's not forget about the economics. More expensive to run, vastly less ad space, and content owners (the whole web) are going to be pissed and will put up ever higher walls.

bigyikes3y ago

Just wait until we have a model that automatically translates vague, awkward prompts into something more useful.

Put differently: if Google’s search models already have the ability to return great results for poor queries, why couldn’t a large language model (or a plug-in for one) learn the same algorithm?

unshavedyak3y ago

> When you use ChatGPT in that way, you'll get useless generic answers. It seems to shine specifically when being more specific, detailed, which also suggests users are willing and able (education level) to give such rich input.

Sidenote, i've found GPT useful enough to pay for (GPT Plus) by doing the opposite. Or rather, i find it very useful when i struggle to search for problems. ChatGPT helps guide me to new search or research terms, sometimes even providing the answer more directly.

It feels like the olden days where Google was great at finding a movie based on some vague movie description. GPT does that for a ton of things for me, enough that i found it useful.

It hasn't replaced online research but it has accelerated it for me.

LASR3y ago

What people forget is the underlying capability - LLMs are able to do reasoning.

So the one-track thinking of garbage-in-garbage-out is not the limitation any more.

What we're precisely now able to do is garbage-in-less-garbage out.

You can take a vague prompt in and ask GPT to hypothesize on what it means, why the user is asking that question and then generate a detailed prompt. Then use that that prompt and then perform the search.

Trying this out in the playground, I see a suprisingly capable search experience.

barking_biscuit3y ago

This kind of second-order (and higher-order) usage of LLMs is where things actually start to get much more interesting. The other thing you can do is just train a better model.

I use GPT-4 for debugging a lot now, because it's excellent at taking nothing other than an error message from the console and giving me back what's wrong and how to fix it. It's not perfect, but it's good enough that I reach for it by default now. I don't have API access to GPT-4 yet, and so I was comparing how well GPT-3.5 performed at this same task and for the example I tried, it just didn't get close enough for me to truly find it useful, so I wouldn't begin to rely on it in my daily workflow unlike GPT-4.

But... what I am actually quite interested in, and what I'm seeing a lot of, is exactly how far can you push a less capable model through prompt engineering? I think it's actually surprisingly further than you might have initially thought.

okasaki3y ago

Have you tried it?

I just typed "eli5 hn vs redit" (misspelled reddit), and it understands perfectly.

theage3y ago

Why even write a book when every insight in them can be shaken out of an LLM? You know, shorter content doesn't have to mean mind-melting right? Strip away the self-marketing poison found in every social media post and lets see.

Books were very simple machines to dress up our thoughts and the community reoriented itself well to the demands of the machine. But progress marches on the graves of obsolete machines as it did quills, book presses, typewriters, telegrams, libraries, word processors. Joe Reader has the same access to mind blowing dynamic text that creatives only wish to see as a finishing tool. He won't settle for the old glass window over static text just to please the artisan book writer.

davidthewatson3y ago

No, they're not, but my reasons for reaching that conclusion are somewhat different:

1) I don't think chatting with anything, human or machine, is a learning experience, particularly since the machine veracity is poor, untrustworthy, and Hinton's resignation today tells you everything you need to know about the narrative inside big research orgs right now.

2) Recognition vs. recall. Given that it's the equivalent of an informal language CLI, which I prefer by the way; but there is no recognition (as in symbols) only recall.

Long story short, I think the emergent need is for written communication, with a tip of the hat to Daniele Procida:

https://ubuntu.com/blog/engineering-transformation-through-d...

Except that what's missing is a human-computer collaboration, i.e. sensemaking with another tip of the hat to Peter Pirolli:

https://www.efsa.europa.eu/sites/default/files/event/180918-...

stephencoyner3y ago

I think the future interface is a smart assistant for your life that gives you suggestions on what you should be doing (both for work and personal life). Sure, there may be a prompting text box, but the assistants will be so good at suggesting that you won't need it very often (besides searching for the occasional thing or giving feedback).

Driving these suggestions is all of your data as well as your goals and values that you can give to the assistant in natural language.

At work the goal might be: "I want to sell $100,000 worth of widgets this quarter" and it will break down step by step how that might be possible.

For personal life it might be "I want to get involved in the kayaking community" and it will recommend activities, clubs, etc.

Once these assistants are good enough, it will be reckless to not use one (especially at work). We will then live in a world where AI and human live together and make decisions together hand in hand. Buckle up.

barking_biscuit3y ago

This is the main scenario I see that leads to "Oops, we didn't see that one coming, but in retrospect that was NOT a good idea".

stephencoyner3y ago

Can you expand on this?

pornel3y ago

When it's about the future, limitations of current implementations aren't a strong argument.

ChatGPT can be confidently stupid, but what if it gets better?

You need explanations/affordances of what it can and can't do only when its capabilities are limited. If it really could do whatever you asked, you wouldn't need it. Just say what you want.

efields3y ago

A contextualized chatbot is still a chatbot. I think they’re going to stick around for a while… we’ve effectively been trying this out on the web since the AskJeevs days, and that dream is mostly realized now.

tekni53y ago

Not a very convincing argument, your gizmo sliders and checkboxes or whatever aren't going to replace chatbots but only slightly extend them and really aren't needed if AI gets better over time.

Some neural connection to the brain that will interpret your thoughts is the only logical thing that will supersede chat ai, but that won't happen for a while. Maybe a connection to all your data will happen first so the AI will better understand what type of person you are and what you want, that's already probably happening based on past responses.

morelisp3y ago

> I've convinced you that chatbots are a terrible interface for LLMs.

I was already convinced of this. What I'm not convinced of, and the article has little to say about, is

> Chatbots Are Not the Future... chatbots are not the future of interfaces.

Chatbots are a terrible interface to LLMs, and yet they are absolutely going to be the future of every third godawful website I must visit.

jasfi3y ago

I agree that prompts are mostly just context. Although you can get quite detail with that, to a degree that it doesn't feel that way. That's why I'm building InventAI, to help with that process: https://inventai.xyz

aiisahik3y ago

Google Search got to be a pretty successful business and probably still the single most popular information retrieval tool on earth - it was done using with a single input box.

throwaway48373y ago

If you think the far future (100s of years) involves being able to talk to a synthetic humanoid using spoken language, then Chatbots are almost certainly a point on the curve.

amadeuspagel3y ago

> The interface looks the same as a Google search box

Indeed, and like any other text box, like whatsapp, like word — all tools that no one uses because they lack affordances.

born-jre3y ago

Very very offtopic:

He called ChatGPT oracle, nice but not enough.

I want someone to name chatbot their `oracle of delphi` plz. thank you.

hoppyhoppy23y ago

She*

born-jre3y ago

*their chatbot

MattPalmer10863y ago

TL;DR, more targeted tools could be more helpful for specific tasks than an unstructured text interface for everything.

raggi3y ago

one thing I can say for certain: scroll handlers definitely aren't the future

dang3y ago

I agree, but:

"Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting."

https://news.ycombinator.com/newsguidelines.html

freediver3y ago

Comments like this confirm that you are indeed reading every comment on the website, which should be out of the realm of humanely possible.

2 more replies

morelisp3y ago

For once, that everyone agrees scroll handlers are godawful but yet after years they're still omnipresent is germane to the article's topic.

j / k navigate · click thread line to collapse

152 comments

autokad3y ago

> When I go up the mountain to ask the ChatGPT oracle a question, I am met with a blank face. What does this oracle know?

hammyhavoc3y ago

Yes, sure. But you know what makes most of human history's output of art, music, literature et al great? Intent, attention to detail and self-expression.

anonylizard3y ago

Really?

You want to write an adventure, but want to keep it unpredictable. Ask the AI for ideas, there, the adventure is now a true adventure, not a fake mirage created by the writer.

No need to describe scenery, no need to describe character appearances. Feed those descriptions into txt2img, and you get portraits that would have cost $1000/pic from top tier artists.

6 more replies

moffkalast3y ago

The popularity of Minecraft and other procedural games would imply that there is still a large number of people who value exploring the unknown generated content, even if it means it's not curated.

Yes the quality won't be as good, but you do get quantity instead. And the quality will improve.

2 more replies

Ferret74463y ago

> But you know what makes most of human history's output of art, music, literature et al great? Intent, attention to detail and self-expression.

1 more reply

lannisterstark3y ago

>I'll continue to chase things that have intent and artistry behind every aspect of them.

For someone with that POV you sure are peddling the "Everything is soulless, stop enjoying things" perspective.

jutrewag3y ago

1 more reply

autokad3y ago

1 more reply

root_axis3y ago

barking_biscuit3y ago

It's just down to small context window. Once the context window is big enough to fit entire examples in the training data, then it should be trivially solvable to train a model to do it.

regularjack3y ago

Scalpel manufacturers don't advertise their scalpels as capable of performing surgery on their own.

warent3y ago

Do you have an example of AI false advertising?

1 more reply

anonymouskimmer3y ago

hammyhavoc3y ago

The LLM is a yes-man with no experiences.

ambicapter3y ago

barking_biscuit3y ago

1 more reply

vkou3y ago

> Chatgpt is a tool, if you dont know how to use the tool, stop complaining that you don't like it.

(And most of the time, it quietly leaves the screw untightened, the wire clipped, and the tire with a hole in it. It's the user who's wrong, of course.)

tootie3y ago

MuffinFlavored3y ago

> Chatgpt is a tool

When you use a hammer or a drill, do you expect it to sometimes not hit/screw the nail?

If ChatGPT is a tool for knowledge transfer/extraction, it can't hallucinate/lie to you/be wrong/make stuff up.

brokencode3y ago

Web searches will for sure give you wrong answers. Even professors or other experts in a field will be wrong sometimes. Heck, even Einstein got some things wrong.

Your goalpost is in the wrong spot. Tools don’t need to be and probably never can be perfect. But that doesn’t mean they’re not useful.

1 more reply

ukuina3y ago

> When you use a hammer or a drill, do you expect it to sometimes not hit/screw the nail?

willio583y ago

Exactly. I use ChatGPT for help coding sometimes and it's like 50/50 if I get an answer that is truly helpful, but that's infinitely better of a tool than I thought we'd have 1 year ago.

1 more reply

itake3y ago

> Good tools make it clear how they should be used.

I know how to use my tool (english). LLMs teach computers how to use that tool too.

karmakurtisaani3y ago

I think the problem in your example of cancelling the subscription is that service providers often make it difficult on purpose. I doubt they'll allow chatbots to make it any simpler.

itake3y ago

I get support emails for "how do I cancel my Apple App Store Subscription?" when Apple governs the cancellation process in a centralized and simple manner.

I also get support emails for password resets, which I try to make as simple as possible.

People don't want to learn new tools if their existing tools (language) work just fine.

phillipcarter3y ago

> Programming languages rarely have an opinion for how they are to be used

Erm, they absolutely have an opinion. That's why I can't just write however I like in whatever language. I need to stick to the designer's opinions on syntax and semantics otherwise it won't work.

itake3y ago

rurp3y ago

itake3y ago

> Subscription cancellation buttons are intentionally confusing and hard to find.

really? I feel like Apple's App Store provides great UX with their warning emails and centralized subscription management view. It is well documented too: https://support.apple.com/en-us/HT202039

But I still get emails asking me to cancel subscriptions.

raincole3y ago

It's just ridiculous. The author even stated:

And Google is one of the most used tools. Probably more used than pen and paper today.

mxuribe3y ago

rocketbop3y ago

The point and click adventure games from Sierra and Lucas Arts were a huge step forward in interaction, although you didn't have to use your imagination as much to solve the puzzles.

And here we are again asking users to type their way to success.

notahacker3y ago

The other obvious UX comparison point is with sending instant messages to a person until they get it right....

cout3y ago

Chatgpt or a successor dynamically generating in-game dialogue could be fun. Or maybe it won't be. I'm interested in seeing it done.

graiz3y ago

Chatbots are the future but your points are valid. They don't provide affordances, however chatbots provide a form of progressive disclosure and direct interaction that was previously impossible.

xtiansimon3y ago

> “Chatbots are the future…”

I’m distressed by the growing use of chatbots in online payments. You do know the ontology—get balance information, make a payment, customer service.

I much prefer to not speak aloud to a robot on the phone, especially in the office, when there should only be three options.

Conversations with a robot speaking in a mixed tone of obsequiousness and superciliousness make me bananas.

sitkack3y ago

> Text inputs have no affordances

It has all the affordances. You can turn it into anything you want. Want it respond with json, check, it can do that. Turn a wall of text into a Python data structures, check, it can do that too.

tdaltonc3y ago

Whether you want to call it an "empty set" or "the set of all possible sets" isn't really relevant to the authors point that an empty box has discoverability problems.

sitkack3y ago

kmtrowbr3y ago

alanbernstein3y ago

I think the point is that a "chatbot" is, by (the author's) definition, a UI with only a bare text prompt. Once you start building more UI on top of that, you're ... doing what the article suggests.

1 more reply

JusticeJuice3y ago

zmmmmm3y ago

disconcision3y ago

> Text inputs have no affordances

>It has all the affordances.

these are two different ways of saying the same thing

lxgr3y ago

But what if I don't know what I want?

A graphical UI can provide much more and much more intuitive guidance than a chat input ever will. And I say that as a big fan of Unix and the shell.

IanCal3y ago

You can ask it. You can explain your problem and ask how it might be able to help. You can discuss and narrow down with some back and forth what it is you want to do.

I could tell a chat bot I am finding the horizontal split in my editor is annoying because I have a wide monitor, and have it tell me there's a setting for that and ask if I want the default changed.

With a gui I might have to go through the files menu for settings, check if it's in edit-preferences, check tools-options, before maybe having to find out online it's if it's in some settings file.

1 more reply

paulddraper3y ago

Quality quote:

BulgarianIdiot3y ago

140 chars, he means.

*280 chars, he means.

*4000 chars, he means.

*10000 chars, he means.

skybrian3y ago

*she

itsuka3y ago

LesZedCB3y ago

isn't it ironic that there's no autocomplete on the tool that's literally autocomplete? haha

where's the fine-tuned prompt helper model?

andrepd3y ago

We clearly need to ask chatgpt to write a model for the prompt helper model.

abe-1013y ago

Both Bard and Bing chat have auto-complition

twelve403y ago

moffkalast3y ago

Or more like a basic understanding of what the thing can do and what it can't do. It's not that complicated.

c7b3y ago

skybrian3y ago

Web search changed that. Most queries work, at least somewhat.

For web search, there are very few error messages. If you enter a query that doesn't work very well, you get back results that aren't very good or what you wanted, so you try something else.

Better error messages that help users understand what chatbots can actually do would help avoid misconceptions, but this won't happen unless the error messages are trained in.

[1] https://skybrian.substack.com/p/ai-chatbots-dont-know-why-th...

JusticeJuice3y ago

Yeah, exactly. A freeform input promises it can do 'anything at all' but if it can't in reality, it's always a frustrating interface.

Classic example, Siri. It's so easy to quickly find stuff you feel like it should be able to do, but it just can't. "When was my last message from Steve" etc.

wilg3y ago

Yeah, chat isn’t a universally great interface. But it’s a great default because it’s totally free form.

It should be pretty easy (even with today’s APIs and technology) to have an LLM design a user interface for you for your current task.

Simplest way: output a JSON of simple control definitions with every answer.

Coolest way: Just have it generate a full-ass React front end or whatever on every message.

fatherzine3y ago

efields3y ago

We are very good learners. That’s how we got this far. Not all new interface is good and worth learning. Sometimes it feels best to stick with what works.

uxabhishek3y ago

Contextually relevant suggested options (that can be acted upon with a single click), alongside the free form input box will emerge as the norm.

swyx3y ago

Amelia presented a bit more on her demo at our meetup last week:

https://www.latent.space/p/build-ai-ux

full recording in the video at the bottom!

dahwolf3y ago

A lot of people are projecting for ChatGPT to come for the search market, but I wonder how that will play out for lowly cognitive queries.

bigyikes3y ago

Just wait until we have a model that automatically translates vague, awkward prompts into something more useful.

Put differently: if Google’s search models already have the ability to return great results for poor queries, why couldn’t a large language model (or a plug-in for one) learn the same algorithm?

unshavedyak3y ago

It feels like the olden days where Google was great at finding a movie based on some vague movie description. GPT does that for a ton of things for me, enough that i found it useful.

It hasn't replaced online research but it has accelerated it for me.

LASR3y ago

What people forget is the underlying capability - LLMs are able to do reasoning.

So the one-track thinking of garbage-in-garbage-out is not the limitation any more.

What we're precisely now able to do is garbage-in-less-garbage out.

Trying this out in the playground, I see a suprisingly capable search experience.

barking_biscuit3y ago

This kind of second-order (and higher-order) usage of LLMs is where things actually start to get much more interesting. The other thing you can do is just train a better model.

okasaki3y ago

Have you tried it?

I just typed "eli5 hn vs redit" (misspelled reddit), and it understands perfectly.

theage3y ago

davidthewatson3y ago

No, they're not, but my reasons for reaching that conclusion are somewhat different:

2) Recognition vs. recall. Given that it's the equivalent of an informal language CLI, which I prefer by the way; but there is no recognition (as in symbols) only recall.

Long story short, I think the emergent need is for written communication, with a tip of the hat to Daniele Procida:

https://ubuntu.com/blog/engineering-transformation-through-d...

Except that what's missing is a human-computer collaboration, i.e. sensemaking with another tip of the hat to Peter Pirolli:

https://www.efsa.europa.eu/sites/default/files/event/180918-...

stephencoyner3y ago

Driving these suggestions is all of your data as well as your goals and values that you can give to the assistant in natural language.

At work the goal might be: "I want to sell $100,000 worth of widgets this quarter" and it will break down step by step how that might be possible.

For personal life it might be "I want to get involved in the kayaking community" and it will recommend activities, clubs, etc.

barking_biscuit3y ago

This is the main scenario I see that leads to "Oops, we didn't see that one coming, but in retrospect that was NOT a good idea".

stephencoyner3y ago

Can you expand on this?

pornel3y ago

When it's about the future, limitations of current implementations aren't a strong argument.

ChatGPT can be confidently stupid, but what if it gets better?

You need explanations/affordances of what it can and can't do only when its capabilities are limited. If it really could do whatever you asked, you wouldn't need it. Just say what you want.

efields3y ago

tekni53y ago

Not a very convincing argument, your gizmo sliders and checkboxes or whatever aren't going to replace chatbots but only slightly extend them and really aren't needed if AI gets better over time.

morelisp3y ago

> I've convinced you that chatbots are a terrible interface for LLMs.

I was already convinced of this. What I'm not convinced of, and the article has little to say about, is

> Chatbots Are Not the Future... chatbots are not the future of interfaces.

Chatbots are a terrible interface to LLMs, and yet they are absolutely going to be the future of every third godawful website I must visit.

jasfi3y ago

aiisahik3y ago

Google Search got to be a pretty successful business and probably still the single most popular information retrieval tool on earth - it was done using with a single input box.

throwaway48373y ago

If you think the far future (100s of years) involves being able to talk to a synthetic humanoid using spoken language, then Chatbots are almost certainly a point on the curve.

amadeuspagel3y ago

> The interface looks the same as a Google search box

Indeed, and like any other text box, like whatsapp, like word — all tools that no one uses because they lack affordances.

born-jre3y ago

Very very offtopic:

He called ChatGPT oracle, nice but not enough.

I want someone to name chatbot their `oracle of delphi` plz. thank you.

hoppyhoppy23y ago

She*

born-jre3y ago

*their chatbot

MattPalmer10863y ago

TL;DR, more targeted tools could be more helpful for specific tasks than an unstructured text interface for everything.

raggi3y ago

one thing I can say for certain: scroll handlers definitely aren't the future

dang3y ago

I agree, but:

"Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting."

https://news.ycombinator.com/newsguidelines.html

freediver3y ago

Comments like this confirm that you are indeed reading every comment on the website, which should be out of the realm of humanely possible.

2 more replies

morelisp3y ago

For once, that everyone agrees scroll handlers are godawful but yet after years they're still omnipresent is germane to the article's topic.

j / k navigate · click thread line to collapse