undefined | Better HN

0 pointsplaidfuji2y ago0 comments

This is a very cool demo - if you dig deeper there’s a clip of them having a “blind” AI talk to another AI with live camera input to ask it to explain what it’s seeing. Then they, together, sing a song about what they’re looking at, alternating each line, and rhyming with one another. Given all of the isolated capabilities of AI, this isn’t particularly surprising, but seeing it all work together in real time is pretty incredible.

But it’s not scary. It’s… marvelous, cringey, uncomfortable, awe-inspiring. What’s scary is not what AI can currently do, but what we expect from it. Can it do math yet? Can it play chess? Can it write entire apps from scratch? Can it just do my entire job for me?

We’re moving toward a world where every job will be modeled, and you’ll either be an AI owner, a model architect, an agent/hardware engineer, a technician, or just.. training data.

0 comments

anon3738392y ago

> We’re moving toward a world where every job will be modeled

After an OpenAI launch, I think it's important to take one's feelings about the future impact of the technology with a HUGE grain of salt. OpenAI are masters of hype. They have been generating hype for years now, yet the real-world impacts remain modest so far.

Do you remember when they teased GPT-2 as "too dangerous" for public access? I do. Yet we now have Llama 3 in the wild, which even at the smaller 8B size is about as powerful as the [edit: 6/13/23] GPT-4 release.

As someone pointed out elsewhere in the comments, a logistic curve looks exponential in the beginning, before it approaches saturation. Yet, logistic curves are more common, especially in ML. I think it's interesting that GPT-4o doesn't show much of an improvement in "reasoning" strength.

jdietrich2y ago

A Google search for practically any long-tail keywords will reveal that LLMs have already had a very significant impact. DuckDuckGo has suffered even more. Social media is absolutely lousy with AI-powered fraud of varying degrees of sophistication.

It's glib to dismiss safety concerns because we haven't all turned into paperclips yet. LLMs and image gen models are having real effects now.

We're already at a point where AI can generate text and images that will fool a lot of people a lot of the time. For every college-educated young person smugly pointing out that they aren't fooled by an image with six-fingered hands, there are far more people who had marginal media literacy to begin with and are now almost defenceless against a tidal wave of hyper-scaleable deception.

We're already at a point where we're counselling elders to ignore late-night messages from people claiming to be a relative in need of an urgent wire transfer. What defences do we have when an LLM will be able to have a completely fluent, natural-sounding conversation in someone else's voice? I'm not confident that I'd be able to distinguish GPT-4o from a human speaker in the best of circumstances and I'm almost certain that I could be fooled if I'm hurried, distracted, sleep deprived or otherwise impaired.

Regardless of any future impacts on the labour market or any hypothesised X-risks, I think we should be very worried about the immediate risks to trust and social cohesion. An awful lot of people are turning into paranoid weirdos at the moment and I don't particularly blame them, but I can see things getting seriously ugly if we can't abate that trend.

autoexec2y ago

> I'm not confident that I'd be able to distinguish GPT-4o from a human speaker in the best of circumstances and I'm almost certain that I could be fooled if I'm hurried, distracted, sleep deprived or otherwise impaired.

Set a memorable verification phrase with your friends and loved ones. That way if you call them out of the blue or from some strange number (and they actually pick up for some reason) and you tell them you need $300 to get you out of trouble they can ask you to say the phrase and they'll know it's you if you respond appropriately.

I've already done that and I'm far less worried about AI fooling me or my family in a scam than I am about corporations and governments using it without caring about the impact of the inevitable mistakes and hallucinations. AI is already being used by judges to decide how long people should go to jail. Parole boards are using it to decide who to keep locked up. Governments are using it to decide which people/buildings to bomb. Insurance companies are using to deny critical health coverage to people. Police are using it to decide who to target and even to write their reports for them.

More and more people are going to get badly screwed over, lose their freedom, or lose their lives because of AI. It'll save time/money for people with more money and power than you or I will ever have though, so there's no fighting it.

verticalscaler2y ago

The way to get around your side channel verification phrase is by introducing an element of stress and urgency: "omg, help, I'm being robbed and they need $300 immediately or they'll hurt me, no time for a passphrase!" can additionally feign memory loss.

Alternatively while it may be difficult to trick you directly, phishing the passphrase from a more naive loved one or bored coworker and then parroting it back to you is also a possibility. 'etc.

Phone scams are no joke and this is getting past the point where regular people can be expected to easily filter them out.

withinboredom2y ago

Or just ask them to tell them something only you both know (a story from childhood, etc). Reminds me of a book where this sort of thing was common (don't remember the title):

1. something you have

2. something you know

3. something you are

These three things are required for any authz.

2 more replies

cbm-vic-202y ago

"Hey Janelle, what's wrong with Wolfie?"

2 more replies

hirako20002y ago

People are and have always been screwed over by modestly equiped humans.

1 more reply

s3p2y ago

"Hey mom and dad, we need a memorable phrase so AI bots can't call us and pretend to be each other."

fauigerzigerk2y ago

I think humankind has managed massive shifts in what and who you could trust several times before.

We went from living in villages where everyone knew each other to living in big cities where almost everyone is a stranger.

We went from photos being relatively reliable evidence to digital photography where anyone can fake almost anything and even the line between faking and improving is blurred.

We went from mass distribution of media being a massive capital expenditure that only big publishers could afford to something that is free and anonymous for everyone.

We went from a tiny number of people in close proximity being able to initiate a conversation with us to being reachable for everyone who could dial a phone number or send an email message.

Each of these transitions caused big problems. None of these problems have ever been completely solved. But each time we found mitigations that limit the impact of any misuse.

I see the current AI wave as yet another step away from trusting superficial appearances to a world that requires more formal authentication protocols.

Passports were introduced long ago but never properly transitioned into the digital world. Using some unsigned PDF allegedly representing a utility bill as proof of address seems questionable as well. And the way in which social security numbers are used for authentication in the US is nothing short of bizarre.

So I think there are some very low hanging fruits in terms of authentication and digital signatures. We have all the tools to deal with the trust issues caused by generative AI. We just have to use them.

ant6n2y ago

During these boundaries people can die. Consider the advent of yellow journalism and the connection with the Spanish-American war 1898: https://en.m.wikipedia.org/wiki/American_propaganda_of_the_S...

1 more reply

b1122y ago

Outside of the transition to a large city, virtually everything you've mentioned happened in the last 1/2 century. Even the phone was expensive, and not widely in use in under 100 years ago.

That's massive fast change, and we haven't culturally caught up to any of it yet.

2 more replies

insane_dreamer2y ago

Just because we haven't yet destroyed the human race through the use of nuclear weapons doesn't mean that it can't or won't happen now that we have the capacity to do so. And I would add that we developed that capacity in less than 50 years of creating the first atomic bomb. We're now living on a knife's edge and at the merge of safeguards which we don't give much thought to on a daily basis because we hope that they won't fail.

That's how I look at where we're going with AI. Plunge along into the new arms race first and build the capacity, then later figure out the treaties and safeguards which we hope will keep our society safe (and by that I don't mean a Skynet-like AI-powered destruction, but the upheaval of our society potentially as impactful as the industrial revolution.)

Humanity will get through it, I'm sure. But I'm not confident it will be without a lot of pain and suffering for a large percentage of people. We also managed to survive 2 world wars in the last century--but it cost the lives of 100 million people.

indigochill2y ago

I tend to think the answer is to go back to villages, albeit digital ones. Authentication only enforces that an account is accessed by the correct "user", but particularly in social media many users are bad actors of various stripes. The strongest account authentication in the world doesn't help with that.

So the question, I think, is how do we reclaim trust in a world where every kind of content can be convincingly faked? And I think the answer is by rebuilding trust between users such that we actually have reason to simply trust the users we're interacting with aren't lying to us (and that also goes for building trust in the platforms we use). In my mind, that means a shift to small federated and P2P communication since both of these enable both the users and the operators to build the network around existing real-world relationships. A federation network can still grow large, but it can do so through those relationships rather than giving institutional bad actors as easy of an entrance as anyone else.

2 more replies

golemotron2y ago

> Each of these transitions caused big problems. None of these problems have ever been completely solved. But each time we found mitigations that limit the impact of any misuse.

This a problem with all technology. The mitigations are like technical debt but with a difference. You can fix technical debt. Short of societal collapse mitigations persist, the impacts ratchet upward and disproportionately affect people at the margin.

There's an old (not quite joke) that if civilization fell, a large percentage of the population would die of the effects of tooth decay.

fullstackchris2y ago

Sure, all tech has 'real' effects. It's kinda the definition of tech. But all of these concerns more or less fall into the category of "add it to the list of things you have to watch out for living in the 21st century" - to me, this is nothing crazy (yet)

The nature of this tech itself is probably what is getting most people - it looks, sounds and feels _human_ - it's very relatable and easy for a non-tech person to understand it and thus get creeped out. I'd argue there are _far_ more dangerous technologies out there, but no one notices and / or cares because they don't understand the tech in the first place!

jdietrich2y ago

>to me, this is nothing crazy (yet)

The "yet" is carrying a lot of weight in that statement. It is now five years since the launch of GPT-2, three years since the launch of GPT-3 and less than 18 months since the launch of ChatGPT. I cannot think of any technology that has improved so much in such a short space of time.

We might hit an inflection point and see that rate of improvement stall, but we might not; we're not really sure where that point might lie, because there's likely to still be a reasonable amount of low-hanging fruit regarding algorithmic and hardware efficiency. If OpenAI and their peers can maintain a reasonable rate of improvement for just a few more years, then we're looking at a truly transformational technology, something like the internet that will have vast repercussions that we can't begin to predict.

The whole LLM thing might be a nothingburger, but how much are we willing to gamble on that outcome?

1 more reply

t4ng0pwn3d2y ago

If you get off the internet you'd not even realise these tools exists though. And for the statement that all jobs will be modelled to be true, it'd have to be impacting the real world.

ben_w2y ago

Is it even possible to "get off the internet" without also leaving civilisation in general at this point?

> it'd have to be impacting the real world

By writing business plans? Getting lawyers punished because they didn't realise that "passes bar exam" isn't the same as "can be relied on for citations"? By defrauding people with synthesised conversations using stolen voices? By automating and personalising propaganda?

Or does it only count when it's guiding a robot that's not merely a tech demo?

1 more reply

Cyphase2y ago

If you get away from roads you wouldn't realize engines exist. Also, the internet is (part of) the real world.

1 more reply

knallfrosch2y ago

Capabilities aren't the problem, cultural adoption is. Just yesterday I talked to someone who still googles solutions to their Excel table woes. Didn't they know of Copilot?

Maybe they didn't know, maybe none of their colleagues used it, their company didn't pay for it, or maybe all they need is an Excel update.

But I am confident that using Copilot would be faster than clicking through the sludge that are Microsoft Office help pages (third party or not.)

So I think it is correct to fear capabilities, even if the real world impace is still missing. When you invent an airplane, there won't be an airstrip to land on yet. Is it useless, won't it change anything?

2 more replies

lobochrome2y ago

HN comments, too. Long, grammatically perfect comments that sound hollow and a bit lengthy are everywhere now.

It's still early, and I don't see much in corporate communications, for instance, but it will be quite the change.

dri_ft2y ago

>Long, grammatically perfect comments that sound hollow and a bit lengthy

It's worse than I thought. They've already managed to mimick the median HN user perfectly!

myspy2y ago

No problem, I'm here to keep the language sophistication level low.

1 more reply

ff102y ago

I tried to make ChatGPT generate a counterpoint for that but it turns out you're right.

schmorptron2y ago

Yes. The old heuristics of if something is generated by grammar and sentence structure don't work as well anymore. The thing that fucks me up the most about it is that I now constantly have to be uncertain about whether something is human or not. Of course, you've always had to be careful about misinformation on the internet, but this raises the scalability of false, hollow, and harmful output to new levels. Especially if it's a topic I'm trying to learn about by reading random articles (or comments), there isn't much of a frame of reference to what's good info and what's hallucinated garbage.

I fear that at some point the anonymity that made the internet great in the first place will be destroyed by this.

2 more replies

neves2y ago

I'm a non native English speaker. Edge new feature of automatically improving my text is a God send. Unfortunately it is blocked at work.

vmfunction2y ago

Many business doesn't want to send their data to a third party such as OpenAI, so until locally run LLM becomes wildly available in businesses.

red-iron-pine2y ago

as the meme goes "always has been"

i remember seeing the change when GPT-2 was announced

fsloth2y ago

We’ve reached a stage, where it would be advisable to not release recent photos of yourself, nor any video with sound clips to public, unless you want an AI fake instaperson of yourself starting to reach out to member of your externally visible social network, asking for money, emergency help, etc.

I guess we need to have an AI secretary to take in all phonecalls from now on (spam folder will become a lot more interesting with celebrity phone calls, your dead relative phoning you etc)

smokel2y ago

Hopefully, we will soon enter the stage where nobody believes anything they see anymore. Then, you no longer have to be afraid of being misinterpreted, because nobody is listening anymore anyway. Great time to be alive!

5 more replies

PeterisP2y ago

I think for most people it's far too late, as there exists at least something on the internet and that something is sufficient - photos can be aged virtually and a single photo is enough, voice doesn't change much and you need only a tiny sample, etc.

And that's the case even if you've never ever posted anything on your social media - it could be family&friends, or employer, or if you're ever been in a public-facing job position that has ever done any community outreach, or ever done a public performance with your music or another hobby, or if you've ever walked past a news crew asking questions to bystanders of some event, or if you've ever participated in some contests or competitions or sports leagues, etc, all of that is generally findable in various archives.

1 more reply

visarga2y ago

> I guess we need to have an AI secretary to take in all phonecalls

Why not an AI assistant in the browser to fend all the adversarial manipulation and spam AIs on the web? Going online without your AI assistant would be like venturing without a mask during COVID

I foresee a cat-and-mouse game, AIs for manipulation vs AIs for protection one upping each other. It will be like immune system vs viruses.

sethammons2y ago

I'm paranoid enough that I now modulate my voice and speak differently when answering an unknown phone call just in case they are recording and building a model to call back a loved one later. If they do get a call, they will be like, "why are you talking like that?"

3 more replies

red-iron-pine2y ago

> Social media is absolutely lousy with AI-powered fraud of varying degrees of sophistication.

has been for years mon ami. i remember when they started talking about GPT-2 here, and then seeing a sea-change in places like reddit and quora

quite visible on HN, esp. in certain threads like those involving brands that market heavily, or discussions of particular countries and politics.

infinitezest2y ago

People were already killing each other for thousands of years so introducing tanks was no big deal, I guess. To say nothing of nuclear weapons.

idle_zealot2y ago

What does abating that trend look like? Most AI safety proposals I hear fall into the categories of a) we need to stop developing this technology or b) we need laws that entrench the richest and most powerful organizations in the world as the sole proprietors of this technology. Neighther of those actually sound better than people being paranoid weirdos about trusting text/video/voice. I think that's kinda where we need to be as a culture: these things are not trustworthy, they were only ever good as a rough heuristic, and now that ship has sailed. We have just finished a transition to treating the digital world as part of our "real" world, but it's time to step that back. Using the internet to interact with known trusted parties will still work fine, provided that some authentication can be shared out-of-band offline. Meeting people and discovering businesses and such? There will be more fakes and scams than real opportunities by orders of magnitude, and as technology progresses our filtering will only get worse. We need to roll back to "don't trust anything online, don't share your identity or payment information online" outside of, as mentioned, out-of-band verified parties. You can still message your friends and family, do online banking and commerce, but you can't initiate a relationship with a person or business online without some kind of trusted recommendation.

jdietrich2y ago

>What does abating that trend look like?

I don't think anyone has a good answer to that question, which is the problem in a nutshell. Job one is to start investing seriously in finding possible answers.

>We need to roll back to "don't trust anything online, don't share your identity or payment information online"

That's easy to say, but it's a trillion-dollar decision. Alphabet and Meta are both worthless in that scenario, because ~all of their revenue comes from connecting unfamiliar sellers with buyers. Amazon is at existential risk. The collapse of Alibaba would have a devastating impact on Chinese exporters, with massive consequent geopolitical risks. Rolling back to the internet of old means rolling back on many years worth of productivity and GDP growth.

2 more replies

pixl972y ago

Trust is more complex then we take credit for.

Even when it comes to people like our parents, there are things we would trust them to do, and things that we would not trust them to do. But what happens when you have zero trusted elements in a category?

At the end of the day, the digital world is the real world, not some seperate place 'outside the environment'. Trying to treat digital like it doesn't exist puts you in a dangerous place to be deceived. For example if you're looking for XYZ and you manage to leak this into the digital world, said digital world may manipulate your trusted friends via ads, articles, the social media posts they see on what they think about XYZ before you ask them.

selfmodruntime2y ago

Point a) is just point b) in disguise. You're just swapping companies for governments.

This tech is dangerous, and I'm currently of the opinion that its uses for malicious purposes are far better and more significant than LLM's replacing anyone's jobs. The bullshit asymmetry principle is very incredibly significant for covert ops and asymmetric warfare, and generating convincing misinformation has become basically free overnight.

emporas2y ago

>Regardless of any future impacts on the labour market or any hypothesised X-risks

Discovering an asteroid full of gold, with as much gold as half the earth to put a modest number, would have huge impact to the labour market. Anything conductive like copper, silver, mining jobs would all go away. Also housing would be obsolete as we would all live in golden houses. A huge impact to the housing market, yet it doesn't seem such a bad thing to me.

>We're already at a point where we're counselling elders to ignore late-night messages from people claiming to be a relative in need of an urgent wire transfer.

Anyone can prove their identity, or identities, over the wire, wire-fully or wire-lessly, anything you like. When i did go to university, i was the only one attending the cryptography class, no one else showed up for a boring class like this. I wrote a story about the Electrona Corp in my blog.

What i say to people for at least 2 years now, is that "Remember when governments were not just some cryptographic algorithms?" Yeah, that's gonna change. Cryptography is here to stay, it is not as dead as people think and it's gonna make a huge blast.

myrmidon2y ago

> Discovering an asteroid full of gold, with as much gold as half the earth to put a modest number, would have huge impact

All this would do is crash the gold price. Also note that all the gold at our disposal right now (worldwide) basically fits into a cube with 20m edges (its not as much as you might think).

Gold is not suitable to replace steel as building material (because it has much lower strength and hardness), nor copper/aluminium as conductor (it's a worse conductor than copper and much worse in conductivity/weigth than aluminium). The main technical application short term would be gold plated electrical contacts on every plug and little else...

2 more replies

om82y ago

> What i say to people for at least 2 years now, is that "Remember when governments were not just some cryptographic algorithms?" Yeah, that's gonna change. Cryptography is here to stay, it is not as dead as people think and it's gonna make a huge blast.

The thing about cryptography and government is that it's easy to imagine for a great technology to be adapted on the governmental level because of its greatness. But it is another thing to actually implement it. We live in a bubble, where almost anyone knows about cryptographic hashes and RSA, but for most of the people it is not the case.

Another thing is that political actors are tending to try to concentrate power in their own hands. No way they will delegate a decision making to any form of algorithm — being cryptographic or not.

1 more reply

horns4lyfe2y ago

A lot of these are non-AI problems. People trying to defraud the elderly need to be taken out back and shot, that’s not an AI issue.

pixl972y ago

Right, I'll just get right on a plane and travel to whereverthefuckville overseas and ask for permission to face blast the scammers. The same scammers that are donating a lot of money to their local (probably very poor) law enforcement to keep their criminal enterprise quite. This will go well.

visarga2y ago

> I'm not confident that I'd be able to distinguish GPT 4o from a human speaker

Probably why it's not released yet. It's unsafe for phishing.

2OEH8eoCRo02y ago

I think people are dismissive for a few reasons.

- It helps them sleep at night if their creation doesn't put millions of people out of work.

- Fear of regulation

miohtama2y ago

> What defences do we have when an LLM will be able to have a completely fluent, natural-sounding conversation in someone else's voice?

The world learnt to deal with Nigerian Prince emails and nobody is falling to those anymore. Nothing was changed - no new laws or regulations needed.

Phishing calls have been going on without an AI for decades.

You can be skeptical and call back. If you know your friends or family you should be able to find an alternative way to get in touch always without too much effort in the modern connected world.

Just recently a gang in Spain was arrested for "son in trouble" scam. No AI used. Most of the parents are not fooled in this.

https://www.bbc.com/news/world-europe-68931214

The AI might have some marginal impact, but it does not matter in the big picture of scams. While it is worrisome, it is not a true safety concern.

fhe2y ago

> yet the real-world impacts remain modest so far.

I second that. I remember when Google search first came out. Within a few days it completely changed my workflow, how I use the Internet, my reading habits. It easily 5 ~ 10x the value of Internet for me over a couple of weeks.

LLMs is doing nothing of the sort for me.

sethammons2y ago

Google was a step function, a complete leveling up in terms of usability of returned data.

ChatGPT does this again for me. I am routinely getting zero useful results on the first page or two of Google searches, but AI is answering or giving me guidance quickly.

Maybe this would not seem such an improvement if Google's results were like they were 10 years ago and not barely usable blogspam

short_sells_poo2y ago

> I am routinely getting zero useful results on the first page or two of Google searches, but AI is answering or giving me guidance quickly.

To me, this just sounds like Google Search has become shit, and since Google simply isn't going to give up the precious ad $$$ that the current format is generating, the next best thing is ChatGPT. But this is different from saying that ChatGPT is a similar step up like Search was.

For what it's worth, I agree with you that Google Search has become unusable. Google basically destroyed it's best product (for users), by turning it into an ad riddles shovelware cesspit.

That ChatGPT is similarly good like Google Search used to be, is a tragedy. Basically we had a conceptually simple product that functioned very well, and we are replacing it with a significantly more complex product.

xanderlewis2y ago

What are you searching for? I see people complaining about this a lot but they never give examples. Google is chock full of spam, yes, but it still works for me.

iamacyborg2y ago

Google’s results are themselves an AI product though. You’re just comparing different AIs.

palad1n2y ago

OMG I remember trying Google when it was in beta, and HOLY CRAP what I had been using was like freakin night and day. AltaVista: remember that? That was the state of the art before that, and it did not compare. Night and day.

wenc2y ago

I remember Google being marginally better than Altavista but not much more.

The cool kids in those days used Metacrawler, which meta searched all the search engines.

6 more replies

Vinnl2y ago

And I'm sure that it's doing that for some people, but... I think those are mostly in the industry. For most of the people outside the tech bubble, I think the most noticeable impact it has had on their lives so far is that they've seen it being talked about on the news, maybe tried ChatGPT once.

That's not to say it won't have more significant impact in the future; I wouldn't know. But so far, I've yet to see the hype get realised.

RhodesianHunter2y ago

>LLMs is doing nothing of the sort for me.

Don't use it for things you're already an expert in, it can't compare to you yet.

Use it for learning new things, or for things you aren't very good at and don't want to bother with. For these it's incredible.

XCSme2y ago

For me, LLMs mostly replaced search. I run local Ollama, and whenever I need help with coding/docs/examples, I just ask Mixtral7x8B, and get an answer instantly, tailored to my needs.

ben_w2y ago

> OpenAI are masters of hype. They have been generating hype for years now, yet the real-world impacts remain modest so far.

Perhaps.

> Do you remember when they teased GPT-2 as "too dangerous" for public access? I do. Yet we now have Llama 3 in the wild, which even at the smaller 8B size is about as powerful as the [edit: 6/13/23] GPT-4 release.

The statement was rather more prosaic and less surprising; are you sure it's OpenAI (rather than say all the AI fans and the press) who are hyping?

"""This decision, as well as our discussion of it, is an experiment: while we are not sure that it is the right decision today, we believe that the AI community will eventually need to tackle the issue of publication norms in a thoughtful way in certain research areas.

…

We are aware that some researchers have the technical capacity to reproduce and open source our results. We believe our release strategy limits the initial set of organizations who may choose to do this, and gives the AI community more time to have a discussion about the implications of such systems."""

anon3738392y ago

That's fair: the statement isn't hyperbolic in its language. But remember that GPT-2 was barely coherent. In making this statement, I would argue that OpenAI was trying to impart a sense of awe and danger designed to attract the kind of attention that it did. I would argue that they have repeatedly invoked danger to impart a sense of momentousness to their products. (And to further what is now a pretty transparent effort to monopolize the tech through regulatory intervention.)

ben_w2y ago

> (And to further what is now a pretty transparent effort to monopolize the tech through regulatory intervention.)

I disagree here also: the company has openly acknowledged that this is a risk to be avoided with regards to safety related legislation, what they've called for looks a lot more like "we don't want a prisoner's dilemma that drives everyone to go fast at the expense of safety" rather than "we're good everyone else is bad".

lynx232y ago

> yet the real-world impacts remain modest so far.

I spend a part of yesterday evening sorting my freshly dried t-shirts into 4 distinct piles. I used OpenAI Vision (through BeMyEyes) from my phone. I got a clear description of each and every piece of clothing, including print, colours and brand. I am blind BTW. But I guess you are right, no impact at all.

> Yet we now have Llama 3 in the wild

Yes, great, THANKS Meta, now the Scammers have something to work with. Thats a wonderful achievement which should be praised! </sarcasm>

anon3738392y ago

> I got a clear description of each and every piece of clothing, including print, colours and brand. I am blind BTW.

That is a really great application of this tech. And definitely qualifies as real-world impact. Thanks for sharing that!

keiferski2y ago

I can’t even get GPT 4 to reliably take a list of data and put it in a CSV. It gets a problem every single time.

People read too many sci-fi books and then project their fantasies on to real-world technologies. This stuff is incredibly powerful and will have social effects, but it’s not going to replace every single job by next year.

mFixman2y ago

GPT-4 is better at planning than at executing.

Have you tried asking it to generate a regex to transform your list into a CSV?

Etherlord872y ago

I remember when people used to argue about regex being bad or good, with a lot of low quality regex introducing bugs in codebases.

Now we have devs asking AI to generate regex formulas and pasting it into code without much concern on its validity.

1 more reply

keiferski2y ago

No, I’ll give that a shot. I have just been asking it to convert output into a CSV, which used to work somewhat well. It stumbles when there is more complexity though.

1 more reply

jstummbillig2y ago

> Do you remember when they teased GPT-2 as "too dangerous" for public access? I do.

I can't help but notice the huge amount of hindsight and bad faith that it demonstrated here. Yes, now we are aware that the internet did not drown in a flood of bullshit (well, not noticeably more), when GPT-2 was released.

But was it obvious? I certainly thought that there was a chance that the amount of blog spam that could be generated effortlessly might just make internet search unusable. You are declaring "hype", when you could also say "very uncertain and conscientious". Is this not something we want people in charge to be careful with?

pixl972y ago

I think the problem is, we did drown in a flood of bullshit, but we've just somehow missed it.

Even in this thread people talk about "Oh I use ChatGPT rather than Google search because Google is just stuffed with shit". And on HN there are plenty of discussions about huge portion of reddit threads being regurgitated older comments.

kenjackson2y ago

GPT-4 already seems better at reasoning than most people. It just has an unusual training domain of Internet text.

havercosine2y ago

I was going to say the same thing. For some real world estimation tasks where I don't want 100% accuracy (example: analysing working capital of a business based on balance sheet, analysing some images and estimating inventory etc.) the job done by GPT-4o is better than fresh MBA graduates from tier 2/tier 3 cities in my part of world.

Job seekers currently in college have no idea what is about to hit them in 3-5 years.

selfmodruntime2y ago

I agree. HN's and the tech bubble's bias many people are not noticing is that it's full of engineers comparing GPT-4 to software engineering tasks. In programming, the margin of error is incredibly slim in the way that a compiler either accepts entirely correct code (in its syntax of course) or rejects it. There is no in between, and verifying software to be correct is hard.

In any other industry where just need an average margin of error close to a human's work and verification is much easier than generating possible outputs, the market will change drastically.

1 more reply

jahnu2y ago

I’d love to see this! Can you give us a couple of concrete examples of this that we can check?

golol2y ago

not really. Even a human bad at reasoning can take 1 hour of time to tinker around and figure things out. GPT-4 just does not have the deep planning/reasoning ability necessary for that.

theshrike792y ago

Have you seen some people with technology? =)

They won't "take 1 hour of time", they try it once or twice and give up.

lynx232y ago

I think you might be falling for selection bias. I guess you are surrounding yourself with a lot of smart people. "tinker around and figure things out" is definitely something certain humans (bad at reasoning) can't do. I already prefer the vision model when it comes to asking for a picture description (blind user) over many humans I personally know. The machine is usually more detailed, and takes the time to read the text, instead of trying to shortcut and decide for me whats important. Besides, people from the english speaking countries do not have to deal with foreign languages. Everyone else has to. "Aber das ist ja in englisch" is a common blocker for consuming information around here. I tell you, if we dont manage to ramp up education a few notches, we'll end up with even higher stddev when it comes to practical intelligence. We already have perfectly normal seeming humans absolutely unable to participate on the internet.

trashtester2y ago

Reasoning and planning are different things. It's certainly getting quite good at deductive reasoning, especially when forced to check it's own arguments for flaws every time it states something. (I had a several hour chat with it yesterday, and I was very impressed about the progress.)

Planning is different in that it is an essential part of agency. That's what Q* is supposed to add. My guess is that planning is the next type of functionality to be added to GPT. I wouldn't be surprised if they already have a version internally with such functionality, but that they've decided to hold it back for now for reasons such as safety (some may care about the election this year) or simply that the inference costs are so huge they cannot possibly expose it publicly.

__MatrixMan__2y ago

Does it need those things if it can just tap into artifacts generated by humans who did spend that hour?

1 more reply

bamboozled2y ago

If everyone is average at reasoning then it must not be a very important trait or we’d all be at reasoning school getting better at it.

Really philosophy seems to be one of the least important subjects right now. Hardly anyone learns about it in school.

If it was so important to success in the wild than it would stand to reason we all work hard at improving our reasoning skills, but very few do.

ben_w2y ago

What schools teach is what governments who set the curriculum like to think is important, which is why my English lessons had a whole section on the Shakespearean (400-year-old, English, Christian) take on the life and motivations of a Jewish merchant living in Venice, followed up with a 80 year old (at the time) English poem on exactly how bad it is to watch your friends choke to death as their lungs melt from chlorine gas in the trenches of the first world war.

These did not provide useful life-lessons for me.

(The philosophy A-level I did voluntarily seemed to be 50% "can you find the flaws in this supposed proof of the existence of god?")

2 more replies

owenpalmer2y ago

They're masters of hype because their products speak for themselves (literally)

famouswaffles2y ago

Yeah. Open ai are certainly not masters of hype lol. They released their titular product to basically no fanfare or advertisement. ChatGPT took off on Word of Mouth alone. They dropped GPT-4 without warning and waited months to ship it's most exciting new feature (image input).

Even now, they're shipping text-image 4o but not the new voice while leaving old-voice up and confusing/disappointing a whole lot of people. This is a pretty big marketing blunder.

fullstackchris2y ago

> ChatGPT took off on Word of Mouth alone.

I remember for a good 2-3 months in 2023 ALL you could see on tiktok / youtube shorts was just garbage about 'how amazing' ChatGPT was. Like - video after video and I was surprised of the repeat content being recommended to me... No doubt openAI (or something) was behind that huge marketing push

6 more replies

danielscrubs2y ago

"real-world impacts remain modest so far." Really? My Google usage has went down with 90% (it would just lead me to some really bad take from a journalist anyway, while ChatGPT can just hand me the latest research and knows my level of expertise). Sure it is not so helpful at work, but if OpenAI hasnt impacted the world I fail to see which company have in this decade.

kortilla2y ago

“Replaced Google” is definitely an impact, but it’s nothing compared to the people that were claiming entire industries would be wiped out nearly overnight (programming, screenwriting, live support, etc).

jdietrich2y ago

Speak to some illustrators or voiceover artists - they're talking in very bleak terms about their future, because so many of them are literally being told by clients that their services are no longer required due to AI. A double-digit reduction in demand is manageable on aggregate, but it's devastating at the margin. White-collar workers having to drive Ubers or deliver packages because their jobs have been taken over by AI is no longer a hypothetical.

1 more reply

monk_e_boy2y ago

colleges are seeing apprentices placements drop - why train an apprentice for two years when ChatGPT will do the work for them?

1 more reply

mewpmewp22y ago

I think mostly claims have been around multiplying the efforts of people for now.

Sabinus2y ago

If Google hadn't ruined Search to help Advertising perhaps it wouldn't have been such a stark comparison in information quality.

d1sxeyes2y ago

Search was always a byproduct of Advertising. Don’t blame Google for sticking to their business model.

We were naive to think we could have nice things for free.

2 more replies

tux19682y ago

It will be interesting to see how they compare, years from now, when ChatGPT has been similarly compromised.

1 more reply

selfmodruntime2y ago

There is little other way of making money from search.

anon3738392y ago

I believe you, and I do turn to an LLM over Google for some queries where I'm not concerned about hallucination. (I use Llama 3 most of the time, because the privacy is absolute.)

But OpenAI is having a hard time retaining/increasing ChatGPT users. Also, Alphabet's stock is about as valuable as it's ever been. So I don't think we have evidence that this is really challenging Google's search dominance.

danielscrubs2y ago

Google is an ad company. Ad prices are on an auction and most companies believe that they need ads. Less customers don't necessarily mean that the earnings go down, as when the clicks go down the prices might go up (without ad competitors). Ergo, they don't compete (yet at least).

But ChatGPT has really hurt Google's brand image.

osigurdson2y ago

Ironically, I was like that for a while, but now use regular google search again quite a bit. A lot of times, good old stack overflow is best.

bobviolier2y ago

The questions I ask ChatGPT have (almost) no monetary value for Google (programming, math, etc).

The questions I still ask Google, have a lot of monetary value (restaurants, cloths, movie, etc).

galaxyLogic2y ago

I use Google and it gives me AI answers.

But I agree seems SO often helps more than Google-AI.

doug_durham2y ago

It's well known that LLMs don't reason. That's not what they are for. It's a throw away comment to say that a product can't do what it explicitly is unable to do. Reasoning will require different architectures. Even with that LLMs are incredibly useful.

collyw2y ago

Chat GPT 3.5 has been neutered, as it it won't spit out anything that isn't overly politically correct. 4chan were hacking their way around it. Maybe that's why they decided it was "too dangerous".

FrustratedMonky2y ago

" GPT-4o doesn't show much of an improvement in "reasoning" strength."

Maybe that is GPT-5.

And this release really is just incremental improvements in speed, and tying together a few different existing features.

koonsolo2y ago

> yet the real-world impacts remain modest so far

Go ask any teacher or graphician.

denvrede2y ago

That's one of my biggest fears, teachers using AI generated content without "checks" to raise / teach / test our children.

KronisLV2y ago

> Do you remember when they teased GPT-2 as "too dangerous" for public access? I do.

Maybe not GPT-2, but in general LLMs and other generative AI types aren't without their downsides.

From companies looking to downsize their staff to replace them with software, to the work of artists/writers being devalued somewhat, to even easier scams and something like the rise of AI girlfriends, which has also gotten some critique, some of those can probably be a net negative.

Even when it's not pearl clutching over the advancements in technology and the social changes that arise, I do wonder how much my own development work will be devalued due to the somewhat lowered entry barrier into the industry and people looking for quick cash, same as with boot camps leading to more saturation. Probably not my position individually (not exactly entry level), but the market as a whole.

It's kind of at a point where I use LLMs for dev work not to fall behind, cause the productivity gains for simple problems and boilerplate are hard to argue with.

naasking2y ago

> They have been generating hype for years now, yet the real-world impacts remain modest so far.

I feel like everyone who makes this claim doesn't actually have any data to backup it up.

somenameforme2y ago

Like another comment mentioned, sigmoid curves [1] are ubiquitous with neural network systems. Neural network systems can be intoxicating because it's so "easy" (relatively speaking) to go from nothing to 80% in extremely short periods of time. And so it seems completely obvious that hitting 100% is imminent. Yet it turns out that each percent afterwards starts coming exponentially more slowly, and we tend to just bump into seemingly impassable asymptotes far from where we'd like to be.

~8 years ago when self driving technology was all the rage and every major company was getting on board with ever more impressive technological demos, it seemed entirely reasonable to expect that we'd all be in a world of complete self driving imminently. I remember mocking somebody online around the time who was pursuing a class C/commercial trucking license. Yet now a decade later, there are more truckers than ever and the tech itself seems further away than ever before. And that's because most have now accepted that progress on such has basically stalled out in spite of absolutely monumental efforts at moving forward.

So long as LLMs regularly hallucinate, they're not going to be useful for much other than tasks that can accept relatively high rates of failure. And many of those generally creative domains are the ones LLMs are paradoxically the weakest in - like writing. Reading a book written by an LLM would be cruel and unusual punishment given then current state of the art. One domain I do see them completely taking over is search. They work excellently as natural language search engines, and "failure" in such is very poorly defined.

[1] - https://en.wikipedia.org/wiki/Sigmoid_function

mlsu2y ago

I'm not really sure your self-driving analogy is apt here. Waymo has cars on the road right now that are totally autonomous, and just expanded its footprint. It has been longer and more difficult than we all thought, and those early tech demos were a glimmer of what was to come; then we had to grind to get there, with a lot of engineering.

I think what maybe seems not obvious amidst the hype is that there is a hell of a lot of engineering left to do. The fact that you can squash the weights of a neural net down to 3 bits per param and it still works -- is evidence that we have quite a way to go with maturing this technology. Multimodality, improvements to the UX of it, the human-computer interface part of it. Those are fundamental tech things, but they are foremost engineering problems. Getting latency down. Getting efficiency up. Designing the experience, then building it out.

25 years ago, early tech demos on the internet were promising that everyone would do their shopping, entertainment, socializing, etc... online. Breathless hype. 5 years after that, the whole thing crashed, but it never went away. People just needed time to figure out how to use it and what it was useful for, and discover its limitations. 10 years after that, engineering efforts were systematized and applied against the difficult problems that still remained. And now: look at where we are. It just took time.

achierius2y ago

I don't think he's saying that AGI is impossible — almost noone (nowadays) would suggest that it's anything but an engineering challenge. The argument is simply one of scale, i.e. how long that engineering challenge will take to solve. Some people are suggesting on the order of years. I think they're suggesting it'll be closer to decades, if that.

abenga2y ago

AGI being "just an engineering challenge" implies that it is conceptually solved, and we need only figure out how to build it economically.

It most definitely is not.

saberience2y ago

Waymo cars are highly geofenced in areas with good weather and good quality roads. They only just (in January) gained the capability to drive on freeways.

Let me know when you can get a Waymo to drive you from New York to Montreal in winter.

naasking2y ago

> Waymo cars are highly geofenced in areas with good weather and good quality roads. They only just (in January) gained the capability to drive on freeways

They are an existence proof that the original claim that we seem further than ever before is just wrong.

2 more replies

ux-app2y ago

Why do some people gloat about moving goalposts around?

15 years ago self driving of any sort was pure fantasy, yet here we are.

They'll release a version that can drive in poor weather and you'll complain that it can't drive in a tornado.

2 more replies

rossant2y ago

It's been 8 years and I still don't have my autonomous car.

Meanwhile I've been using ChatGPT at work for _more than a year_ and it's been tremendously helpful to me.

This is not hype, this is not about how AI will change our lives in the future. It's there right here, right now.

somenameforme2y ago

Of course. It's quite a handy tool. I love using it for searching documentation for some function that I know the behavior of, but not the name. And similarly, people have been using auto-steer, auto-park, and all these other little 'self driving adjacent' features for years as well. Those are also extremely handy. But the question is, what comes next?

The person I originally responded to stated, "We’re moving toward a world where every job will be modeled, and you’ll either be an AI owner, a model architect, an agent/hardware engineer, a technician, or just.. training data." And that far less likely than us achieving L5 self driving (if not only because driving is quite simple relative to many of the jobs he envisions AI taking over), yet L5 self driving seems as distant as ever as well.

davedx2y ago

> So long as LLMs regularly hallucinate, they're not going to be useful for much other than tasks that can accept relatively high rates of failure.

Yep. So basically they're useful for a vast, immense range of tasks today.

Some things they're not suited for. For example, I've been working on a system to extract certain financial "facts" across SEC filings. ChatGPT has not been helpful at all either with designing or implementing (except to give some broad, obvious hints about things like regular expressions), nor would it be useful if it was used for the actual automation.

But for many, many other tasks -- like design, architecture, brainstorming, marketing, sales, summarisation, step by step thinking through all sorts of processes, it's extremely valuable today. My list of ChatGPT sessions is so long already and I can't imagine life without it now. Going back to Google and random Quora/StackOverflow answers laced with adtech everywhere...

tcgv2y ago

> I've been working on a system to extract certain financial "facts" across SEC filings. ChatGPT has not been helpful at all

The other day, I saw a demo from a startup (don't remember their name) that uses generative AI to perform financial analysis. The demo showed their AI-powered app basically performing a Google search for some companies, loosely interpreting those Google Stock Market Widgets that are presented in such searches, and then fetching recent news and summarizing them with AI, trying to extract some macro trends.

People were all hyped up about it, saying it will replace financial analysts in no time. From my point of view, that demo is orders of magnitude below the capacity of a single intern who receives the same task.

In short, I have the same perception as you. People are throwing generative AI into everything they can with high expectations, without doing any kind of basic homework to understand its strengths and weaknesses.

jstummbillig2y ago

> So long as LLMs regularly hallucinate, they're not going to be useful for much other than tasks that can accept relatively high rates of failure.

But is this not what humans do, universally? We are certainly good at hiding it – and we are all good at coping with it – but my general sense when interacting with society is that there is a large amount of nonsense generated by humans that our systems must and do already have enormous flexibility for.

My sense is that's not an aspect of LLMs we should have any trouble with incorporating smoothly, just by adhering to the safety nets that we built in response to our own deficiencies.

SubiculumCode2y ago

The sigmoid is true in humans too. You can get 80% of the way to being sort of good at a thing in a couple of weeks, but then you hit the plateau. In a lot of fields confidently knowing and applying this has made people local jack of all trades experts... the person that often knows how to solve the problem. But Jack is no longer needed so much. ChatJack got`s your back. Better to be a the person who knows one thing in excruciating detail and depth, and never ever let anyone watch you work or train on your output.

nox1012y ago

I think it's more like an exponential curve where it looks flat moments before it shoots up.

mapping th genome was that way. On a 20yr schedule, barely any progress for 15 and then poof, done ahead of schedule

whilenot-dev2y ago

> or just.. training data.

I have a much less "utopian" view about the future. I remember during the renaissance of neural networks (ca. 2010-15) it was said that "more data leads to better models", and that was at a time when researchers frowned upon the term Artificial Intelligence and would rather use Machine Learning. Fast forward a decade LLMs are very good synthetic data generators that try to mimic human generated input and I can't think somehow that this wasn't the sole initial intent of LLMs. And that's it for me. There's not much to hype and no intelligence at all.

What happens now is that human generated input becomes more valuable and every online platform (including minor ones) will have now some form of gatekeeping in place, rather sooner than later. Besides that a lot of work still can't be done in front of a computer in isolation and probably never will, and even if so, automation is not a means to an end. We still don't know how to measure a lot of things and much less how to capture everything as data vectors.

Aeolun2y ago

The two AI’s talking to each other was like listening to two commercials talking to each other. Like a callcenter menu that you cannot skip. And they _kept repeating themselves_. Ugh. If this is the future I’m going to hide in a cave.

aussieguy12342y ago

My new PC arrives tomorrow. Once I source myself two RTX 3060's I'll be an AI owner, no longer dependant on cloud APIs.

Currently the bottleneck is Agents. If you want a large language model to actually do anything you need an Agent. Agents so far need a human in the loop to keep them sane. Until that problem is solved most human jobs are still safe.

trashtester2y ago

GPT 4o incorporated multimodality directly in the neural network, while reducing inference costs to half.

I fully expect GPT 5 (or at the latest 6) to similarly have native inclusion of agentic capabilities either this year or next year, assuming it doesn't already, but is just kept from the public.

bamboozled2y ago

Going to put the economy in a very, very weird situation if true.

Will be like, the end of millions of careers overnight.

It will probably strongly favour places like China and Russia though, where the economy is already strongly reliant on central control.

trashtester2y ago

> It will probably strongly favour places like China and Russia though, where the economy is already strongly reliant on central control.

I think you may be literally right in the opposite sense to what I think you intended.

China (and maybe Russia) may be able to use central control to have an advantage when it comes to avoiding disasterous outcomes.

But when it comes to the rate of innovation, the US may have an advantage for the usual reasons. Less government intervention (due to lobbyism) combined with having several corporations actively competing with each other to be first/best usually leads to faster innovation. However, the downside may be the it also introduces a lot more risk.

ilaksh2y ago

Agentic capability just means it outputs a function call which it has had for a long time.

trashtester2y ago

That's a very weak form. The way I use "agentic" is that it is trained to optimize the success of an agent, not just predict the next token.

The obvious way to to that is for it to plan a set of actions and evalute each possible way to reach some goal (or avoid an anti-goal). Kind of what AlphaZeros is doing for games. Q* is rumored to be a generalization of this.

selfhoster112y ago

You are far better off investing in one or more 3090s and loading up on DDR RAM.

sambazi2y ago

> Agents so far need a human in the loop to keep them sane.

not quite sure that sanity is a business requirement

aussieguy12342y ago

Yes, but to use a car dealership example, you don't want your Agent to sell a car to someone for $1 https://hothardware.com/news/car-dealerships-chatgpt-goes-aw...

mottiden2y ago

> We’re moving toward a world where every job will be modeled, and you’ll either be an AI owner, a model architect, an agent/hardware engineer, a technician, or just.. training data.

I understand that you might be afraid. I believe that a world where only LLM companies rule the world is not practically achievable except in some distopian universe. The likelihood of the world where the only job are model architects, engineers or technicians is very very small.

Instead, let's consider the positive possibilities that LLMs can bring. It can lead to new and exciting opportunities across various fields. For instance, can serve as a tool to inspire new ideas for writers, artists, and musicians.

I think we are going towards a more collaborative era where computers and humans interact much more. Everything will be a remix :)

altcognito2y ago

> The likelihood of the world where the only job are model architects, engineers or technicians is very very small.

Oh, especially since it will be a priority to automate their jobs, or somehow optimize them with an algorithm because that's a self-reinforcing improvement scheme that would give you a huge edge.

SubiculumCode2y ago

Every corporate workplace is already thinking: How can I surveil and record everything an employee does as training data for their replacement in 3 years time.

mycall2y ago

> Can it do math yet?

GPT-4? Not that well. AI? Definitely

https://deepmind.google/discover/blog/alphageometry-an-olymp...

schnitzelstoat2y ago

Until the hallucination problem is solved, the output can't be trusted.

So outside of use-cases where the user can quickly verify the result (like picking a decent generated image etc.),I can't see it being used much.

zarathustreal2y ago

Never heard of retrieval-augmented generation?

usrbinbash2y ago

RAG? Sure. I even implemented systems using it, and enabling it, myself.

And guess what: RAG doesn't prevent hallucination. It can reduce it, and there are most certainly areas where it is incredibly useful (I should know, because that's what earns my paycheck), but it's useful despite still hallucinations being a thing, not because we solved that problem.

zarathustreal2y ago

Are you implying that you’re the same person I was commenting to or are you just throwing your opinion into the mix?

Regardless, we’ve seen accuracy of ~98% with simple context-based prompting across every category of generation task. Don’t take my word for it, a simple search would show the effectiveness of “n-shot” prompting. Framing it as “it _can_ reduce” hallucinations is disingenuous at best, there really is no debate about how well it works. We can disagree on whether 98% accuracy is a solution but again I’d assert that for >50% of all possible real world uses for an LLM 98% is acceptable and thus the problem can be colloquially referred to as solved.

If you’re placing the bar at 100% hallucination-free accuracy then I’ve got some bad news to tell you about the accuracy of the floating point operations we run the world on

visarga2y ago

> Can it just do my entire job for me?

All AIs up to now lack autonomy. So I'd say until we crack this problem, it is not going to be able to do your job. Autonomy depends on a kind of data that is iterative, multi-turn, and learning from environments not from static datasets. We have the exact opposite, lots of non-iterative, off-policy (human made AI consumed) text.

p1esk2y ago

This is still gpt4. I don’t expect much more from this version than what previous version could do, in terms of reasoning abilities.

But everyone is expecting them to release gpt5 later this year, and it is a bit scary to think what it will be able to do.

trashtester2y ago

It's quite different from gpt4 in two respects:

1) It's natively multi-modal in a way I don't think gpt4 was.

2) It's at least twice as efficient in terms of compute. Maybe 3 times more efficient, considering the increase in performance.

Combined, those point towards some major breakthroughs having gone into the model. If the quality of the output hasn't gone up THAT much, it's probably because the technological innovations mostly were leveraged (for this version) to reduce costs rather than capabilities.

My guess is that we should expect them to leverage the 2x-3x boost in efficiency in a model that is at least as large as GTP4 relatively soon, probably this year unless OpenAI has safety concerns or something, and keeps it internal-only.

jiggawatts2y ago

Branding aside, this pretty much is GPT 5.

The evidence for that is the change in the tokenizer. The only way to implement that is to re-train the entire base model from scratch. This implies that GPT 4o is not a fine-tuning of GPT 4. It's a new model, with a new tokenizer, new input and output token types, etc...

They could have called it GPT-5 and everyone would have believed them.

p1esk2y ago

I’ve used it for a couple of hours to help with coding and it feels very similar to gpt4: still makes erroneous and inconsistent suggestions. Not calling it 4.5 was the right call. It is much faster though.

The expectations for gpt5 are sky high. I think we will see a similar jump as 3.5 -> 4.

mewpmewp22y ago

Pretty sure they said they would not release GPT-5 on Monday. So it's something else still. And I don't see any sort of jump big enough to label it as 5.

I assume GPT-5 has to be a heavier, more expensive and slower model initially.

GPT-4o is like an optimisation of GPT-4.

adroniser2y ago

That doesn't imply that it's GPT-5. A GPT-4 training run probably doesn't take them that long now they've acquired so many GPUs for training GPT-5.

golol2y ago

I think 4o is actually noticeably smarter than 4, after having tried it a tiny bit on the playground.

jimmySixDOF2y ago

There has been speculation that this is the same mystery model floating around on lmsys chat bot arena and they claim a real observable jump on elo scores but this remains to be seen some people don't think its even as capable as GPT4-Turbo so tbd

hackerlight2y ago

It's a completely new model trained from scratch that they've decided to label that way as part of their marketing strategy.

datahack2y ago

All I could think about when watching this demo was how similar capabilities will work on the battlefield. Coordinated AIs look like they will be obscenely effective.

Everything always starts as a toy.

trashtester2y ago

The "Killer app" for AGI/ASI is, I suspect, going to be in robotics, even more so than in replacing "white collar workers".

That includes, beyond literal Killers, all kinds of manufacturing, construction and service work.

I would expect a LOT of funds to go into research all sorts of actuators, artificial muscles and any other technology that will be useful in building better robots.

Companies that can get and maintain a lead in such technologies may reach a position similar to what US Steel had in the 19th century.

That could be the next nvidia.

I would not be at all surprised if we will have a robot in the house in 10 years that can clean and do the dishes, and that is built using basically the same parts as the robots that replace our soldiers and the police.

Who will ultimately control them, though?

bamboozled2y ago

I would expect a LOT of funds to go into research all sorts of actuators, artificial muscles and any other technology that will be useful in building better robots.

If you had an ASI? I don’t think you’d need a lot of funds to go into this area anymore ? Presumably it would all be solved overnight.

trashtester2y ago

Once we have godlike tier ASI, you're probably right. But I expect that robots could become extremely lucrative even when avaiable AI's haven't reached that point yet.

Companies that have a head start at that point, may get a huge first-mover advantage. Also, those companies also very well may have the capability to leverage AI in product development, just like everyone else.

And just as important as the products themselves is the manufacturing capacity to build them at scale. Until we have massive numbers of robots in service, building such infrastructure is likely to be slow and expensive.

EDIT: Also, once we really have the kind of Godlike ASI you envision, no human actions really matter (economically) anymore.

RugnirViking2y ago

its possible. Right now ai + robotics has been a big area of research for a while, and its very good at some tasks, see basically everything boston dynamics does wrt dynamically balancing. They help alongside control systems very well. However for multimodal task planning its not there. A year or two back I wrote a long comment about it but basically there is this idea of "grounding", basically connecting computer vision, object symbols/concepts, and task planning, which remains elusive. Its a similar problem with self driving cars - you want to be able to reason very strongly about things like "place all of the screws into the red holes" in a way that maps automatically to the actions for those things

trashtester2y ago

Yes. As you say, a lot of the limitations so far has been the control part, which is basically AI.

Given the pace that AI is currently moving at, it seems to me that more and more, the mechanical aspect is becoming the limitation.

GPT 4o now seems to be quite good at reasoning about the world from pictures in real time. I would expect it would soon become easy for it to do the high level part of many practical tasks, from housekeeping to manufacturing or construction. (And of course military tasks.)

This leaves the direct low-level actuator control to execute such tasks in detail. But even there, development has been immense. See for instance these soccer playing robots [1]

And as both high level and low level control (if we assume that models soon will add agentic features directly into the neural networks), the only missing peace is the ability to build mechanically capable and reliable robots at a low enough price that they become cheaper than humans for various kinds of work.

There is one more limitation, of course, which is that GPT 4o still requires a constant connection to a data center, and that the models is too large to run within a device or machine.

This is also one of the most critical limitations of self driving. Had the AI within a Tesla had the same amount of compute available as GPT-4o, it should be massively more capable.

[1] https://www.youtube.com/watch?v=RbyQcCT6890

hwbunny2y ago

And people will become utterly stupid in the process.

CTDOCodebases2y ago

or just... unemployed.

megous2y ago

Why so much positivity? It can also murder people, and it will continue being used for that. That's scary.

onion2k2y ago

(IMO) AI cannot murder people. The responsibility of what an AI does falls on the person who deployed it, and to a lesser extent the person who created it. If someone is killed by a fully autonomous weapon then that person has been murdered by the person or people who created and enabled the AI, not the AI itself.

This is no different to saying a person with a gun murdered someone rather than attributing the murder to the gun. An AI gun is just a really fancy gun.

7bit2y ago

There will come a time where complex systems can better be predicted with the use of AI than with mathematical predictions. One use-case could be, feeding body scans into them for cancer prevention. AFAIK this is already researched.

There may come a time where we grow so accustomed to this, that the decision is so heavily influenced by AI, that we believe it more than human decisions.

And then it can very well kill a human through misdiagnostic.

I think it is important to not just put this thought aside, but to evaluate all risks.

apostata2y ago

> And then it can very well kill a home through misdiagnosis.

I would imagine outcomes would be scrutinized heavily for an application like this. There is a difference between a margin of error (existing with human doctors as well) and a sentient ai that has decided to kill, which is what it sounds like you're describing.

If we didn't give it that goal, how does it obtain it otherwise?

bloqs2y ago

Except that with a gun, you have a binary input (the trigger) so you can squarely blame a human for misunderstanding what they did when they accidentally shot someone on the grounds that the trigger didnt work.

A prompt is a _very_ different matter.

Cthulhu_2y ago

The mass murder of Palestinians is already partially blamed or credited to an "AI" system that could identify people. Humans spent seconds reviewing the outcome. This is the reality of AI already being used to assist in killing. AI can't take the blame legally speaking, but it makes it easier to make the call and sleep at night. "I didn't order a strike on this person and their family of eight, the AI system marked this subject as a high risk, high value target". Computer-assisted dehumanization. (Not even necessarily AI)

latexr2y ago

> This is no different to saying a person with a gun murdered someone rather than attributing the murder to the gun.

And “guns don’t kill people, people kill people”¹ is a bad argument created by the people who benefit from the proliferation of guns, so it’s very weird that you’re using that as if it were a valid argument. It isn’t. It’s baffling anyone still has to make this point: easy access and availability of guns makes them more likely to be used. A gun which does not exist is a gun which cannot be used by a person to murder another.

It’s also worth nothing the exact words of the person you’re responding to (emphasis mine):

> It can also murder people, and it will continue being used for that.

Being used. As in, they’re not saying that AI kills on its own, but that it’s used for it. Presumably by people. Which doesn’t contradict your point.

¹ https://en.wikipedia.org/wiki/Guns_don%27t_kill_people,_peop...

spurgu2y ago

We also choose to have cars, which cause a certain amount of death. It's an acceptable tradeoff (which most don't think about much). I'd speculate that it's mostly people who don't use cars who criticize them the most, and the same with guns.

1 more reply

ithkuil2y ago

Probably it will be used for many things. Just like computers, electricity, iron

mavhc2y ago

Don't need AI to murder people, a gun with an actuator on the trigger can murder people easily, add rnd to it and it can murder people at random

interludead2y ago

Anything (almost) can be used for good and bad

kolinko2y ago

So can knives

Cthulhu_2y ago

Yes, but a person wielding a knife has morals, a conscience and a choice, the fear is that an AI model does not. A lot of killer AI science fiction boils down to "it is optimal and logical that humanity needs to be exterminated"; no morality or conscience involved.

7bit2y ago

Which is why there are laws around what knives are allowed and what are banned. Or how we design knifes to be secure. Or how we have a common understanding what we do with knifes - and what not. Such as not giving them to toddlers... So what's your point?

Cthulhu_2y ago

The point is not the tool but how it's used. "What knives are allowed" is a moot point because a butter knife or letter opener can be used to kill someone.

1 more reply

aredox2y ago

Call me back when we have autonomous driving, and when Bitcoin will replace currencies.

What's scary and cringey are your delusions.

plaidfujiOP2y ago

Don’t get me wrong, I’m not suggesting the current capabilities are anywhere near replacing human productivity. Some things are 1 year out, some 5 (maybe self-driving cars by then? Mercedes has it on their roadmap for 2030 and they’ve historically been realistic), some 10+. But the pieces are in place and the investments are being made. The question is no longer “can AI really automate this?”, it’s “how do we get the dataset that will enable us to automate this with AI?”. And as long as Open AI keeps people’s eyes on their whizbang demos, the money will keep flowing…

aksss2y ago

Or a battery.

Jiahang2y ago

make people smart

starfezzy2y ago

Nature had been doing that for billions of years until a few decades ago when we were told "progress" meant we had to stop doing the same thing more peacefully and intentionally.

My guess is the future belongs to those who don't stop—who, in fact, embrace the opposite of stopping.

I would even suggest that the present belongs to those who didn't stop. It may be too late for normal people to ever catch up by the time we realize the trick that was played on us.

jddj2y ago

The present absolutely belongs to those who didn't stop, but it's been a lot longer than a few decades.

Varying degrees of greedy / restless / hungry / thirsty / lustful are what we've got, because how is contentedness ever going to compete with that over millennia?

slfnflctd2y ago

It just occurred to me that this is one of the core things most successful religions have been trying to do in some form from the time they first arose.

I've had a lot of negative things to say about religion for many years. However, as has been often observed, 'perception is reality' to a certain extent when it affects how people behave, and perhaps it's kind of a counterweight against our more selfish tendencies. I just wish we could do something like it without made up stories and bigotry. Secular humanist Unitarians might be about the best we can do right now in my opinion... I'm hoping that group continues to grow (they have been in recent years).

1 more reply

j / k navigate · click thread line to collapse

0 comments

anon3738392y ago

> We’re moving toward a world where every job will be modeled

jdietrich2y ago

It's glib to dismiss safety concerns because we haven't all turned into paperclips yet. LLMs and image gen models are having real effects now.

autoexec2y ago

verticalscaler2y ago

Alternatively while it may be difficult to trick you directly, phishing the passphrase from a more naive loved one or bored coworker and then parroting it back to you is also a possibility. 'etc.

Phone scams are no joke and this is getting past the point where regular people can be expected to easily filter them out.

withinboredom2y ago

Or just ask them to tell them something only you both know (a story from childhood, etc). Reminds me of a book where this sort of thing was common (don't remember the title):

1. something you have

2. something you know

3. something you are

These three things are required for any authz.

2 more replies

cbm-vic-202y ago

"Hey Janelle, what's wrong with Wolfie?"

2 more replies

hirako20002y ago

People are and have always been screwed over by modestly equiped humans.

1 more reply

s3p2y ago

"Hey mom and dad, we need a memorable phrase so AI bots can't call us and pretend to be each other."

fauigerzigerk2y ago

I think humankind has managed massive shifts in what and who you could trust several times before.

We went from living in villages where everyone knew each other to living in big cities where almost everyone is a stranger.

We went from photos being relatively reliable evidence to digital photography where anyone can fake almost anything and even the line between faking and improving is blurred.

We went from mass distribution of media being a massive capital expenditure that only big publishers could afford to something that is free and anonymous for everyone.

We went from a tiny number of people in close proximity being able to initiate a conversation with us to being reachable for everyone who could dial a phone number or send an email message.

Each of these transitions caused big problems. None of these problems have ever been completely solved. But each time we found mitigations that limit the impact of any misuse.

I see the current AI wave as yet another step away from trusting superficial appearances to a world that requires more formal authentication protocols.

ant6n2y ago

During these boundaries people can die. Consider the advent of yellow journalism and the connection with the Spanish-American war 1898: https://en.m.wikipedia.org/wiki/American_propaganda_of_the_S...

1 more reply

b1122y ago

Outside of the transition to a large city, virtually everything you've mentioned happened in the last 1/2 century. Even the phone was expensive, and not widely in use in under 100 years ago.

That's massive fast change, and we haven't culturally caught up to any of it yet.

2 more replies

insane_dreamer2y ago

indigochill2y ago

2 more replies

golemotron2y ago

> Each of these transitions caused big problems. None of these problems have ever been completely solved. But each time we found mitigations that limit the impact of any misuse.

There's an old (not quite joke) that if civilization fell, a large percentage of the population would die of the effects of tooth decay.

fullstackchris2y ago

jdietrich2y ago

>to me, this is nothing crazy (yet)

The whole LLM thing might be a nothingburger, but how much are we willing to gamble on that outcome?

1 more reply

t4ng0pwn3d2y ago

If you get off the internet you'd not even realise these tools exists though. And for the statement that all jobs will be modelled to be true, it'd have to be impacting the real world.

ben_w2y ago

Is it even possible to "get off the internet" without also leaving civilisation in general at this point?

> it'd have to be impacting the real world

Or does it only count when it's guiding a robot that's not merely a tech demo?

1 more reply

Cyphase2y ago

If you get away from roads you wouldn't realize engines exist. Also, the internet is (part of) the real world.

1 more reply

knallfrosch2y ago

Capabilities aren't the problem, cultural adoption is. Just yesterday I talked to someone who still googles solutions to their Excel table woes. Didn't they know of Copilot?

Maybe they didn't know, maybe none of their colleagues used it, their company didn't pay for it, or maybe all they need is an Excel update.

But I am confident that using Copilot would be faster than clicking through the sludge that are Microsoft Office help pages (third party or not.)

2 more replies

lobochrome2y ago

HN comments, too. Long, grammatically perfect comments that sound hollow and a bit lengthy are everywhere now.

It's still early, and I don't see much in corporate communications, for instance, but it will be quite the change.

dri_ft2y ago

>Long, grammatically perfect comments that sound hollow and a bit lengthy

It's worse than I thought. They've already managed to mimick the median HN user perfectly!

myspy2y ago

No problem, I'm here to keep the language sophistication level low.

1 more reply

ff102y ago

I tried to make ChatGPT generate a counterpoint for that but it turns out you're right.

schmorptron2y ago

I fear that at some point the anonymity that made the internet great in the first place will be destroyed by this.

2 more replies

neves2y ago

I'm a non native English speaker. Edge new feature of automatically improving my text is a God send. Unfortunately it is blocked at work.

vmfunction2y ago

Many business doesn't want to send their data to a third party such as OpenAI, so until locally run LLM becomes wildly available in businesses.

red-iron-pine2y ago

as the meme goes "always has been"

i remember seeing the change when GPT-2 was announced

fsloth2y ago

I guess we need to have an AI secretary to take in all phonecalls from now on (spam folder will become a lot more interesting with celebrity phone calls, your dead relative phoning you etc)

smokel2y ago

5 more replies

PeterisP2y ago

1 more reply

visarga2y ago

> I guess we need to have an AI secretary to take in all phonecalls

Why not an AI assistant in the browser to fend all the adversarial manipulation and spam AIs on the web? Going online without your AI assistant would be like venturing without a mask during COVID

I foresee a cat-and-mouse game, AIs for manipulation vs AIs for protection one upping each other. It will be like immune system vs viruses.

sethammons2y ago

3 more replies

red-iron-pine2y ago

> Social media is absolutely lousy with AI-powered fraud of varying degrees of sophistication.

has been for years mon ami. i remember when they started talking about GPT-2 here, and then seeing a sea-change in places like reddit and quora

quite visible on HN, esp. in certain threads like those involving brands that market heavily, or discussions of particular countries and politics.

infinitezest2y ago

People were already killing each other for thousands of years so introducing tanks was no big deal, I guess. To say nothing of nuclear weapons.

idle_zealot2y ago

jdietrich2y ago

>What does abating that trend look like?

I don't think anyone has a good answer to that question, which is the problem in a nutshell. Job one is to start investing seriously in finding possible answers.

>We need to roll back to "don't trust anything online, don't share your identity or payment information online"

2 more replies

pixl972y ago

Trust is more complex then we take credit for.

selfmodruntime2y ago

Point a) is just point b) in disguise. You're just swapping companies for governments.

emporas2y ago

>Regardless of any future impacts on the labour market or any hypothesised X-risks

>We're already at a point where we're counselling elders to ignore late-night messages from people claiming to be a relative in need of an urgent wire transfer.

myrmidon2y ago

> Discovering an asteroid full of gold, with as much gold as half the earth to put a modest number, would have huge impact

All this would do is crash the gold price. Also note that all the gold at our disposal right now (worldwide) basically fits into a cube with 20m edges (its not as much as you might think).

2 more replies

om82y ago

Another thing is that political actors are tending to try to concentrate power in their own hands. No way they will delegate a decision making to any form of algorithm — being cryptographic or not.

1 more reply

horns4lyfe2y ago

A lot of these are non-AI problems. People trying to defraud the elderly need to be taken out back and shot, that’s not an AI issue.

pixl972y ago

visarga2y ago

> I'm not confident that I'd be able to distinguish GPT 4o from a human speaker

Probably why it's not released yet. It's unsafe for phishing.

2OEH8eoCRo02y ago

I think people are dismissive for a few reasons.

- It helps them sleep at night if their creation doesn't put millions of people out of work.

- Fear of regulation

miohtama2y ago

> What defences do we have when an LLM will be able to have a completely fluent, natural-sounding conversation in someone else's voice?

The world learnt to deal with Nigerian Prince emails and nobody is falling to those anymore. Nothing was changed - no new laws or regulations needed.

Phishing calls have been going on without an AI for decades.

You can be skeptical and call back. If you know your friends or family you should be able to find an alternative way to get in touch always without too much effort in the modern connected world.

Just recently a gang in Spain was arrested for "son in trouble" scam. No AI used. Most of the parents are not fooled in this.

https://www.bbc.com/news/world-europe-68931214

The AI might have some marginal impact, but it does not matter in the big picture of scams. While it is worrisome, it is not a true safety concern.

fhe2y ago

> yet the real-world impacts remain modest so far.

LLMs is doing nothing of the sort for me.

sethammons2y ago

Google was a step function, a complete leveling up in terms of usability of returned data.

ChatGPT does this again for me. I am routinely getting zero useful results on the first page or two of Google searches, but AI is answering or giving me guidance quickly.

Maybe this would not seem such an improvement if Google's results were like they were 10 years ago and not barely usable blogspam

short_sells_poo2y ago

> I am routinely getting zero useful results on the first page or two of Google searches, but AI is answering or giving me guidance quickly.

For what it's worth, I agree with you that Google Search has become unusable. Google basically destroyed it's best product (for users), by turning it into an ad riddles shovelware cesspit.

xanderlewis2y ago

What are you searching for? I see people complaining about this a lot but they never give examples. Google is chock full of spam, yes, but it still works for me.

iamacyborg2y ago

Google’s results are themselves an AI product though. You’re just comparing different AIs.

palad1n2y ago

wenc2y ago

I remember Google being marginally better than Altavista but not much more.

The cool kids in those days used Metacrawler, which meta searched all the search engines.

6 more replies

Vinnl2y ago

That's not to say it won't have more significant impact in the future; I wouldn't know. But so far, I've yet to see the hype get realised.

RhodesianHunter2y ago

>LLMs is doing nothing of the sort for me.

Don't use it for things you're already an expert in, it can't compare to you yet.

Use it for learning new things, or for things you aren't very good at and don't want to bother with. For these it's incredible.

XCSme2y ago

For me, LLMs mostly replaced search. I run local Ollama, and whenever I need help with coding/docs/examples, I just ask Mixtral7x8B, and get an answer instantly, tailored to my needs.

ben_w2y ago

> OpenAI are masters of hype. They have been generating hype for years now, yet the real-world impacts remain modest so far.

Perhaps.

The statement was rather more prosaic and less surprising; are you sure it's OpenAI (rather than say all the AI fans and the press) who are hyping?

…

anon3738392y ago

ben_w2y ago

> (And to further what is now a pretty transparent effort to monopolize the tech through regulatory intervention.)

lynx232y ago

> yet the real-world impacts remain modest so far.

> Yet we now have Llama 3 in the wild

Yes, great, THANKS Meta, now the Scammers have something to work with. Thats a wonderful achievement which should be praised! </sarcasm>

anon3738392y ago

> I got a clear description of each and every piece of clothing, including print, colours and brand. I am blind BTW.

That is a really great application of this tech. And definitely qualifies as real-world impact. Thanks for sharing that!

keiferski2y ago

I can’t even get GPT 4 to reliably take a list of data and put it in a CSV. It gets a problem every single time.

mFixman2y ago

GPT-4 is better at planning than at executing.

Have you tried asking it to generate a regex to transform your list into a CSV?

Etherlord872y ago

I remember when people used to argue about regex being bad or good, with a lot of low quality regex introducing bugs in codebases.

Now we have devs asking AI to generate regex formulas and pasting it into code without much concern on its validity.

1 more reply

keiferski2y ago

No, I’ll give that a shot. I have just been asking it to convert output into a CSV, which used to work somewhat well. It stumbles when there is more complexity though.

1 more reply

jstummbillig2y ago

> Do you remember when they teased GPT-2 as "too dangerous" for public access? I do.

pixl972y ago

I think the problem is, we did drown in a flood of bullshit, but we've just somehow missed it.

kenjackson2y ago

GPT-4 already seems better at reasoning than most people. It just has an unusual training domain of Internet text.

havercosine2y ago

Job seekers currently in college have no idea what is about to hit them in 3-5 years.

selfmodruntime2y ago

In any other industry where just need an average margin of error close to a human's work and verification is much easier than generating possible outputs, the market will change drastically.

1 more reply

jahnu2y ago

I’d love to see this! Can you give us a couple of concrete examples of this that we can check?

golol2y ago

not really. Even a human bad at reasoning can take 1 hour of time to tinker around and figure things out. GPT-4 just does not have the deep planning/reasoning ability necessary for that.

theshrike792y ago

Have you seen some people with technology? =)

They won't "take 1 hour of time", they try it once or twice and give up.

lynx232y ago

trashtester2y ago

__MatrixMan__2y ago

Does it need those things if it can just tap into artifacts generated by humans who did spend that hour?

1 more reply

bamboozled2y ago

If everyone is average at reasoning then it must not be a very important trait or we’d all be at reasoning school getting better at it.

Really philosophy seems to be one of the least important subjects right now. Hardly anyone learns about it in school.

If it was so important to success in the wild than it would stand to reason we all work hard at improving our reasoning skills, but very few do.

ben_w2y ago

These did not provide useful life-lessons for me.

(The philosophy A-level I did voluntarily seemed to be 50% "can you find the flaws in this supposed proof of the existence of god?")

2 more replies

owenpalmer2y ago

They're masters of hype because their products speak for themselves (literally)

famouswaffles2y ago

Even now, they're shipping text-image 4o but not the new voice while leaving old-voice up and confusing/disappointing a whole lot of people. This is a pretty big marketing blunder.

fullstackchris2y ago

> ChatGPT took off on Word of Mouth alone.

6 more replies

danielscrubs2y ago

kortilla2y ago

jdietrich2y ago

1 more reply

monk_e_boy2y ago

colleges are seeing apprentices placements drop - why train an apprentice for two years when ChatGPT will do the work for them?

1 more reply

mewpmewp22y ago

I think mostly claims have been around multiplying the efforts of people for now.

Sabinus2y ago

If Google hadn't ruined Search to help Advertising perhaps it wouldn't have been such a stark comparison in information quality.

d1sxeyes2y ago

Search was always a byproduct of Advertising. Don’t blame Google for sticking to their business model.

We were naive to think we could have nice things for free.

2 more replies

tux19682y ago

It will be interesting to see how they compare, years from now, when ChatGPT has been similarly compromised.

1 more reply

selfmodruntime2y ago

There is little other way of making money from search.

anon3738392y ago

I believe you, and I do turn to an LLM over Google for some queries where I'm not concerned about hallucination. (I use Llama 3 most of the time, because the privacy is absolute.)

danielscrubs2y ago

But ChatGPT has really hurt Google's brand image.

osigurdson2y ago

Ironically, I was like that for a while, but now use regular google search again quite a bit. A lot of times, good old stack overflow is best.

bobviolier2y ago

The questions I ask ChatGPT have (almost) no monetary value for Google (programming, math, etc).

The questions I still ask Google, have a lot of monetary value (restaurants, cloths, movie, etc).

galaxyLogic2y ago

I use Google and it gives me AI answers.

But I agree seems SO often helps more than Google-AI.

doug_durham2y ago

collyw2y ago

Chat GPT 3.5 has been neutered, as it it won't spit out anything that isn't overly politically correct. 4chan were hacking their way around it. Maybe that's why they decided it was "too dangerous".

FrustratedMonky2y ago

" GPT-4o doesn't show much of an improvement in "reasoning" strength."

Maybe that is GPT-5.

And this release really is just incremental improvements in speed, and tying together a few different existing features.

koonsolo2y ago

> yet the real-world impacts remain modest so far

Go ask any teacher or graphician.

denvrede2y ago

That's one of my biggest fears, teachers using AI generated content without "checks" to raise / teach / test our children.

KronisLV2y ago

> Do you remember when they teased GPT-2 as "too dangerous" for public access? I do.

Maybe not GPT-2, but in general LLMs and other generative AI types aren't without their downsides.

It's kind of at a point where I use LLMs for dev work not to fall behind, cause the productivity gains for simple problems and boilerplate are hard to argue with.

naasking2y ago

> They have been generating hype for years now, yet the real-world impacts remain modest so far.

I feel like everyone who makes this claim doesn't actually have any data to backup it up.

somenameforme2y ago

[1] - https://en.wikipedia.org/wiki/Sigmoid_function

mlsu2y ago

achierius2y ago

abenga2y ago

AGI being "just an engineering challenge" implies that it is conceptually solved, and we need only figure out how to build it economically.

It most definitely is not.

saberience2y ago

Waymo cars are highly geofenced in areas with good weather and good quality roads. They only just (in January) gained the capability to drive on freeways.

Let me know when you can get a Waymo to drive you from New York to Montreal in winter.

naasking2y ago

> Waymo cars are highly geofenced in areas with good weather and good quality roads. They only just (in January) gained the capability to drive on freeways

They are an existence proof that the original claim that we seem further than ever before is just wrong.

2 more replies

ux-app2y ago

Why do some people gloat about moving goalposts around?

15 years ago self driving of any sort was pure fantasy, yet here we are.

They'll release a version that can drive in poor weather and you'll complain that it can't drive in a tornado.

2 more replies

rossant2y ago

It's been 8 years and I still don't have my autonomous car.

Meanwhile I've been using ChatGPT at work for _more than a year_ and it's been tremendously helpful to me.

This is not hype, this is not about how AI will change our lives in the future. It's there right here, right now.

somenameforme2y ago

davedx2y ago

> So long as LLMs regularly hallucinate, they're not going to be useful for much other than tasks that can accept relatively high rates of failure.

Yep. So basically they're useful for a vast, immense range of tasks today.

tcgv2y ago

> I've been working on a system to extract certain financial "facts" across SEC filings. ChatGPT has not been helpful at all

jstummbillig2y ago

> So long as LLMs regularly hallucinate, they're not going to be useful for much other than tasks that can accept relatively high rates of failure.

My sense is that's not an aspect of LLMs we should have any trouble with incorporating smoothly, just by adhering to the safety nets that we built in response to our own deficiencies.

SubiculumCode2y ago

nox1012y ago

I think it's more like an exponential curve where it looks flat moments before it shoots up.

mapping th genome was that way. On a 20yr schedule, barely any progress for 15 and then poof, done ahead of schedule

whilenot-dev2y ago

> or just.. training data.

Aeolun2y ago

aussieguy12342y ago

My new PC arrives tomorrow. Once I source myself two RTX 3060's I'll be an AI owner, no longer dependant on cloud APIs.

trashtester2y ago

GPT 4o incorporated multimodality directly in the neural network, while reducing inference costs to half.

I fully expect GPT 5 (or at the latest 6) to similarly have native inclusion of agentic capabilities either this year or next year, assuming it doesn't already, but is just kept from the public.

bamboozled2y ago

Going to put the economy in a very, very weird situation if true.

Will be like, the end of millions of careers overnight.

It will probably strongly favour places like China and Russia though, where the economy is already strongly reliant on central control.

trashtester2y ago

> It will probably strongly favour places like China and Russia though, where the economy is already strongly reliant on central control.

I think you may be literally right in the opposite sense to what I think you intended.

China (and maybe Russia) may be able to use central control to have an advantage when it comes to avoiding disasterous outcomes.

ilaksh2y ago

Agentic capability just means it outputs a function call which it has had for a long time.

trashtester2y ago

That's a very weak form. The way I use "agentic" is that it is trained to optimize the success of an agent, not just predict the next token.

selfhoster112y ago

You are far better off investing in one or more 3090s and loading up on DDR RAM.

sambazi2y ago

> Agents so far need a human in the loop to keep them sane.

not quite sure that sanity is a business requirement

aussieguy12342y ago

Yes, but to use a car dealership example, you don't want your Agent to sell a car to someone for $1 https://hothardware.com/news/car-dealerships-chatgpt-goes-aw...

mottiden2y ago

> We’re moving toward a world where every job will be modeled, and you’ll either be an AI owner, a model architect, an agent/hardware engineer, a technician, or just.. training data.

I think we are going towards a more collaborative era where computers and humans interact much more. Everything will be a remix :)

altcognito2y ago

> The likelihood of the world where the only job are model architects, engineers or technicians is very very small.

Oh, especially since it will be a priority to automate their jobs, or somehow optimize them with an algorithm because that's a self-reinforcing improvement scheme that would give you a huge edge.

SubiculumCode2y ago

Every corporate workplace is already thinking: How can I surveil and record everything an employee does as training data for their replacement in 3 years time.

mycall2y ago

> Can it do math yet?

GPT-4? Not that well. AI? Definitely

https://deepmind.google/discover/blog/alphageometry-an-olymp...

schnitzelstoat2y ago

Until the hallucination problem is solved, the output can't be trusted.

So outside of use-cases where the user can quickly verify the result (like picking a decent generated image etc.),I can't see it being used much.

zarathustreal2y ago

Never heard of retrieval-augmented generation?

usrbinbash2y ago

RAG? Sure. I even implemented systems using it, and enabling it, myself.

zarathustreal2y ago

Are you implying that you’re the same person I was commenting to or are you just throwing your opinion into the mix?

If you’re placing the bar at 100% hallucination-free accuracy then I’ve got some bad news to tell you about the accuracy of the floating point operations we run the world on

visarga2y ago

> Can it just do my entire job for me?

p1esk2y ago

This is still gpt4. I don’t expect much more from this version than what previous version could do, in terms of reasoning abilities.

But everyone is expecting them to release gpt5 later this year, and it is a bit scary to think what it will be able to do.

trashtester2y ago

It's quite different from gpt4 in two respects:

1) It's natively multi-modal in a way I don't think gpt4 was.

2) It's at least twice as efficient in terms of compute. Maybe 3 times more efficient, considering the increase in performance.

jiggawatts2y ago

Branding aside, this pretty much is GPT 5.

They could have called it GPT-5 and everyone would have believed them.

p1esk2y ago

The expectations for gpt5 are sky high. I think we will see a similar jump as 3.5 -> 4.

mewpmewp22y ago

Pretty sure they said they would not release GPT-5 on Monday. So it's something else still. And I don't see any sort of jump big enough to label it as 5.

I assume GPT-5 has to be a heavier, more expensive and slower model initially.

GPT-4o is like an optimisation of GPT-4.

adroniser2y ago

That doesn't imply that it's GPT-5. A GPT-4 training run probably doesn't take them that long now they've acquired so many GPUs for training GPT-5.

golol2y ago

I think 4o is actually noticeably smarter than 4, after having tried it a tiny bit on the playground.

jimmySixDOF2y ago

hackerlight2y ago

It's a completely new model trained from scratch that they've decided to label that way as part of their marketing strategy.

datahack2y ago

All I could think about when watching this demo was how similar capabilities will work on the battlefield. Coordinated AIs look like they will be obscenely effective.

Everything always starts as a toy.

trashtester2y ago

The "Killer app" for AGI/ASI is, I suspect, going to be in robotics, even more so than in replacing "white collar workers".

That includes, beyond literal Killers, all kinds of manufacturing, construction and service work.

I would expect a LOT of funds to go into research all sorts of actuators, artificial muscles and any other technology that will be useful in building better robots.

Companies that can get and maintain a lead in such technologies may reach a position similar to what US Steel had in the 19th century.

That could be the next nvidia.

Who will ultimately control them, though?

bamboozled2y ago

I would expect a LOT of funds to go into research all sorts of actuators, artificial muscles and any other technology that will be useful in building better robots.

If you had an ASI? I don’t think you’d need a lot of funds to go into this area anymore ? Presumably it would all be solved overnight.

trashtester2y ago

Once we have godlike tier ASI, you're probably right. But I expect that robots could become extremely lucrative even when avaiable AI's haven't reached that point yet.

EDIT: Also, once we really have the kind of Godlike ASI you envision, no human actions really matter (economically) anymore.

RugnirViking2y ago

trashtester2y ago

Yes. As you say, a lot of the limitations so far has been the control part, which is basically AI.

Given the pace that AI is currently moving at, it seems to me that more and more, the mechanical aspect is becoming the limitation.

This leaves the direct low-level actuator control to execute such tasks in detail. But even there, development has been immense. See for instance these soccer playing robots [1]

There is one more limitation, of course, which is that GPT 4o still requires a constant connection to a data center, and that the models is too large to run within a device or machine.

This is also one of the most critical limitations of self driving. Had the AI within a Tesla had the same amount of compute available as GPT-4o, it should be massively more capable.

[1] https://www.youtube.com/watch?v=RbyQcCT6890

hwbunny2y ago

And people will become utterly stupid in the process.

CTDOCodebases2y ago

or just... unemployed.

megous2y ago

Why so much positivity? It can also murder people, and it will continue being used for that. That's scary.

onion2k2y ago

This is no different to saying a person with a gun murdered someone rather than attributing the murder to the gun. An AI gun is just a really fancy gun.

7bit2y ago

There may come a time where we grow so accustomed to this, that the decision is so heavily influenced by AI, that we believe it more than human decisions.

And then it can very well kill a human through misdiagnostic.

I think it is important to not just put this thought aside, but to evaluate all risks.

apostata2y ago

> And then it can very well kill a home through misdiagnosis.

If we didn't give it that goal, how does it obtain it otherwise?

bloqs2y ago

A prompt is a _very_ different matter.

Cthulhu_2y ago

latexr2y ago

> This is no different to saying a person with a gun murdered someone rather than attributing the murder to the gun.

It’s also worth nothing the exact words of the person you’re responding to (emphasis mine):

> It can also murder people, and it will continue being used for that.

Being used. As in, they’re not saying that AI kills on its own, but that it’s used for it. Presumably by people. Which doesn’t contradict your point.

¹ https://en.wikipedia.org/wiki/Guns_don%27t_kill_people,_peop...

spurgu2y ago

1 more reply

ithkuil2y ago

Probably it will be used for many things. Just like computers, electricity, iron

mavhc2y ago

Don't need AI to murder people, a gun with an actuator on the trigger can murder people easily, add rnd to it and it can murder people at random

interludead2y ago

Anything (almost) can be used for good and bad

kolinko2y ago

So can knives

Cthulhu_2y ago

7bit2y ago

Cthulhu_2y ago

The point is not the tool but how it's used. "What knives are allowed" is a moot point because a butter knife or letter opener can be used to kill someone.

1 more reply

aredox2y ago

Call me back when we have autonomous driving, and when Bitcoin will replace currencies.

What's scary and cringey are your delusions.

plaidfujiOP2y ago

aksss2y ago

Or a battery.

Jiahang2y ago

make people smart

starfezzy2y ago

Nature had been doing that for billions of years until a few decades ago when we were told "progress" meant we had to stop doing the same thing more peacefully and intentionally.

My guess is the future belongs to those who don't stop—who, in fact, embrace the opposite of stopping.

I would even suggest that the present belongs to those who didn't stop. It may be too late for normal people to ever catch up by the time we realize the trick that was played on us.

jddj2y ago

The present absolutely belongs to those who didn't stop, but it's been a lot longer than a few decades.

Varying degrees of greedy / restless / hungry / thirsty / lustful are what we've got, because how is contentedness ever going to compete with that over millennia?

slfnflctd2y ago

It just occurred to me that this is one of the core things most successful religions have been trying to do in some form from the time they first arose.

1 more reply

j / k navigate · click thread line to collapse