undefined | Better HN

0 pointshooande3y ago0 comments

After watching the demos I'm convinced that the new context length will have the biggest impact. The ability to dump 32k tokens into a prompt (25,000 words) seems like it will drastically expand the reasoning capability and number of use cases. A doctor can put an entire patient's medical history in the prompt, a lawyer an entire case history, etc.

As a professional...why not do this? There's a non-zero chance that it'll find something fairly basic that you missed and the cost is several cents. Even if it just phrases something obvious in a way that makes you think, it's well worth the effort for a multimillion dollar client.

If they further increase the context window, this thing becomes a Second Opinion machine. For pretty much any high level job. If you can put in ALL of the information relevant to a problem and it can algorithmically do reasoning, it's essentially a consultant that works for pennies per hour. And some tasks that professionals do could be replaced altogether. Out of all the use cases for LLMs that I've seen so far, this seems to me to have the biggest potential impact on daily life.

edit (addition): What % of people can hold 25,000 words worth of information in their heads, while effectively reasoning with and manipulating it? I'm guessing maybe 10% at most, probably fewer. And they're probably the best in their fields. Now a computer has that ability. And anyone that has $20 for the OpenAI api can access it. This could get wild.

0 comments

amelius3y ago

> As a professional...why not do this?

Because your clients do not allow you to share their data with third parties?

MagicMoonlight3y ago

What we really need is a model that you can run on your own hardware on site. I could never use this for business because they're reading everything you send through it, but let me run it on my own server and it would be unbelievably useful.

Imagine being able to ask your workplace server if it has noticed any unusual traffic, or to write a report on sales with nice graphs. It would be so useful.

colinsane3y ago

> What we really need is a model that you can run on your own hardware on site.

we won’t have that until we come up with a better way to fund these things. “””Open””” AI was founded on that idea, had the most likely chance of anyone in reaching it: even going into things with that intent they failed and switched to lock down the distribution of their models, somehow managed to be bought by MS despite the original non-profit-like structure. you just won’t see what you’re asking for for however long this field is dominated by the profit motive.

sounds3y ago

Nah, it's already being done for GPT-3's competitors and will likely be done soon for GPT-4's competitors

https://arstechnica.com/information-technology/2023/03/you-c...

1 more reply

malborodog3y ago

> we won’t have that until we come up with a better way to fund these things.

Isn't this already happening with LLaMA and Dalai etc.? Already now you can run Whisper yourself. And you can run a model almost as powerful as gpt-3.5-turbo. So I can't see why it's out of bounds that we'll be able to host a model as powerful as gpt4.0 on our own (highly specced) Mac Studio M3s, or whatever it may be.

f0e4c2f73y ago

https://github.com/tatsu-lab/stanford_alpaca

Tada! Literally runs on a raspberry pi (very slowly).

GPT models are incredible but the future is somehow even more amazing than that.

I suspect this will be the approach for legal / medical uses (if regulation allows).

bradleyjg3y ago

I don’t think on site is going to be necessary. Even the US intelligence community trusts that Amazon isn’t spying on the spies.

But a model that can run on a private cluster is certainly something that there’s going to be demand for. And once that exists there’s no reason it couldn’t be run on site.

You can see why OpenAI doesn’t want to do it though. SaaS is more lucrative.

hailwren3y ago

> Even the US intelligence community trusts that Amazon isn’t spying on the spies

I’m not sure what you mean by this, but it’s incorrect. Sensitive USG information is not processed on Amazon’s commercial offering.

> The Amazon-built cloud will operate behind the IC’s firewall, or more simply: It’s a public cloud built on private premises. [1]

I think this is what you’re referring to.

1 - https://www.theatlantic.com/technology/archive/2014/07/the-d...

1 more reply

jsjohnst3y ago

> You can see why OpenAI doesn’t want to do it though.

Except they already do offer private cluster solutions, you just need usage in the hundreds of millions of tokens per day before they want to talk to you (as in they might before that, but that’s the bar they say on the contact us page).

lillecarl3y ago

VMware charges people per GB RAM attached to a VM. Selling on-prem software on consumption is very much possible. It's closed source software, so as long as they require 443 outbound to tick consumption that'd work.

MagicMoonlight3y ago

You can’t take the risk. A cloud server is too open and too juicy. Everyone will be probing it 24/7, including hostile countries

slt20213y ago

maybe we implement tokenizer+first layer in Javascript on client side and that is enough to preserve raw data on client side and send to GPT only first layer (which is a vector of float values anyway)

matrix gets decoded into text on the client side in Javascript, so we receive send and receive from chatGPT only vector of floats (obfuscation?)

geysersam3y ago

It's a good idea but it seems quite easy to invert the first layer mapping. And the output of the last layer you can easily steal just by doing whatever would've been done in the client.

nickpeterson3y ago

Could open ai just offer letting you upload a key and use it for interaction with the model? Basically encrypt the model with the key and all the request and responses are all secure?

I’m probably oversimplifying but it feels doable.

2 more replies

qualudeheart3y ago

That model will be out in a few years. GPT-3 175b only took two years until someone trained an open source equivalent that could run on a few gpu devices.

ElFitz3y ago

Or using homomorphic encryption. I remember some managing to run inference on encrypted images.

See

- https://www.zama.ai/post/encrypted-image-filtering-using-hom...

- https://news.ycombinator.com/item?id=31933995

- https://news.ycombinator.com/item?id=34080882

- https://news.ycombinator.com/item?id=25786154

- https://news.ycombinator.com/item?id=30626182

runnerup3y ago

Homomorphic encryption has a 1,000,000x performance disadvantage. So maybe in 30 years as we approach the Landauer limit, but not in our generation.

1 more reply

zmmmmm3y ago

> What we really need is a model that you can run on your own hardware on site

So, LLaMA? It's no chat gpt but it can potentially serve this purpose

make33y ago

the problem is that if you steal the weights then you can serve your own gpt4, and it's very hard to prove that what you're serving is actually gpt4. (or you could just start using it without paying ofc)

RealityVoid3y ago

Presumably, if you give it identical prompts you get identical answers?

3 more replies

javier23y ago

You mean hallucinated graphs and word prediction unusual traffic? No, I get that the models are very impressive, but im not sure they actually reason

vintermann3y ago

The thinking elevator

So the makers proudly say

Will optimize its program

In an almost human way.

And truly, the resemblance

Is uncomfortably strong:

It isn't merely thinking,

It is even thinking wrong.

Piet Hein wrote that in reference to the first operator-free elevators, some 70+ years ago.

What you call hallucination, I call misremembering. Humans do it too. The LLM failure modes are very similar to human failure modes, including making up stuff, being tricked to do something they shouldn't, and even getting mad at their interlocutors. Indeed, they're not merely thinking, they're even thinking wrong.

1 more reply

drdeca3y ago

What do you mean by “actually reason”?

And, presumably you wouldn’t have the model generate the graph directly, but instead have it generate code which generates the graph.

I’m not sure what they had in mind for the “unusual traffic” bit.

ElFitz3y ago

For that I'd suggest using Langchain with Wolfram Alpha.

It's already been done and discussed:

- https://news.ycombinator.com/item?id=34422122

- https://news.ycombinator.com/item?id=34422627

jahewson3y ago

“on site”? Medical records are in the cloud already.

peterashford3y ago

Yes, but their access is strictly controlled. There's a lot of regulation about this stuff

1 more reply

MisterBastahrd3y ago

Yes. But they aren't being shared with third party AIs. Sharing personal medical information with OpenAI is a good way to get both your medical org to get ground into dust under a massive class action lawsuit, not to mention huge fines from the government.

2 more replies

chaxor3y ago

Is and ought https://en.m.wikipedia.org/wiki/Is%E2%80%93ought_problem

szundi3y ago

What is “the cloud” - that’s the question

1 more reply

serf3y ago

right, but 'the cloud' isn't a singular monolithic database that everyone inputs data into for a result.

most of the AI offerings on the table right now aren't too dissimilar from that idea in principle.

1 more reply

cutler3y ago

This is Microsoft we're talking about. Hail the new old overlord.

ludovicianul3y ago

Isn't Azure OpenAI suppose to do this? (not locally, but private)

PaulHoule3y ago

Models you can run locally are coming soon.

la647103y ago

Just ask OpenAI and it will build it :)

sshumaker3y ago

Just use the Azure hosted solution, which has all of Azure's stronger guarantees around compliance. I'm sure it will update with GPT-4 pricing shortly.

https://azure.microsoft.com/en-us/products/cognitive-service...

(disclaimer: I work for Microsoft but not on the Azure team)

ndm0003y ago

Agreed. The same data privacy argument was used by people not wanting their data in the cloud. When an LLM provider is trusted with a company’s data, the argument will no longer be valid.

tippytippytango3y ago

This is the biggest thing holding gpt back. Everyone with meaningful data has their hands tied behind their back. So many ideas and the answer is “we can’t put that data in gpt” very frustrating.

chillfox3y ago

Another way of looking at that is that gpt not being open source so companies can run it on their own clusters is holding it back.

nine_k3y ago

Back in the day Google offered hardware search appliances.

Offering sealed server boxes with GPT software, to run on premises heavily firewalled or air-gapped could be a viable business model.

1 more reply

geysersam3y ago

Sounds like an easy problem to solve if this is actually the case.

OpenAI just has to promise they won't store the data. Perhaps they'll add a privacy premium for the extra effort, but so what?

null_shift3y ago

Anyone that actually cares about the privacy of their data isn’t going to be satisfied with just a “promise”.

1 more reply

netsroht3y ago

That's why more research should be poured into homomorphic encryption where you could send encrypted data to the API, OpenAI would then run computation on the encrypted data and we would only decrypt on the output locally.

I would never send unencrypted PII to such an API, regardless of their privacy policy.

majkinetor3y ago

Which will disappear soon enough, once it is able to run on premise.

jnwatson3y ago

Then you really shouldn’t use Google Docs, or Photoshop Online, or host your emails in the cloud.

thiht3y ago

You’re saying it like you found a loophole or something but it’s not a gotcha. Yes, if you manipulate sensitive data you shouldn’t use Google Docs or Photoshop online (I’m not imaginative enough to think of a case where you would put sensitive data in Photoshop online though, but if you do, don’t) or host your emails in the cloud. I’ve worked in a moderate size company where everything was self hosted and it’s never been an issue

Sharlin3y ago

Doctor-patient or lawyer-client confidentiality is slightly more serious a matter than your examples. And obviously it’s one thing for you to decide where to store your own things and another thing for someone else doing it with your confidential data…

selfhoster113y ago

Google Docs and Photoshop Online have offline alternatives (and if you ask me, native MS Office is still the golden standard for interoperability of editable documents), and I use neither in my work or personal life.

Email is harder, but I do run my own email server. For mostly network related reasons, it is easier to run it as a cloud VM, but there's nothing about the email protocol itself that needs you to use a centralised service or host it in a particular network location.

jfoster3y ago

MS Office is just one login away from storing documents in the cloud. I bet tons of users have their documents stored in OneDrive without realizing it.

https://support.microsoft.com/en-us/office/save-documents-on...

faeriechangling3y ago

These services now have privacy and legally complaint options nowadays, and decisions to use them get board approval.

OpenAI just simply does not offer the same thing at this time. You’re stuck using Facebook’s model for the moment which is much inferior.

jstummbillig3y ago

In these particular circles the idea of privacy at a technical and ideological level is very strong, but in a world where the biggest companies make their money by people freely sharing data every chance they get, I doubt that most would object to an affordable way to better their chances of survival or winning a court case.

seydor3y ago

I assume that health providers will use servers that are guaranteed not to share data with openAi

rawoke0836003y ago

Is that any different then sending you patient down the hall to get an MRI from a 3rd-party-practise operating inside the hospital ? (honest question, I don't know ?)

fatihcelikbas3y ago

How about open-source models like Flan-T5? What stops you from using them in your own cloud account or better on-prem?

ShadowBanThis013y ago

And yet boatloads of people are willing to hand their phone number over to OpenAI.

pas3y ago

It'll be a routine question, and everyone will just nod to give consent.

textninja3y ago

Biggest roadblock right here. Need a private version for sure.

TeeMassive3y ago

You mean like the cloud?

alfor3y ago

do you use gmail?

as3003y ago

What's the difference between entering in an anonymized patient history into ChatGPT and, say, googling their symptoms?

woodson3y ago

Anonymization doesn’t just mean “leave their names out”. An entire patient's medical history is in itself personal identifiable information. Instead of googling for “headache”, they now have stored a copy of every medical detail in your life.

dragonwriter3y ago

If it is de-identified per HIPAA, little.

OTOH, the more patient info you are putting in, the less likely it is actually legally deidentified.

pmoriarty3y ago

Data that has ostensibly been "anonymized" can often be deanonymized.

Gene_Parmesan3y ago

Especially when the system we're discussing is literally the most advanced AI model we're aware of.

mliker3y ago

if you enter an entire patient history, it could easily be an identifier of the person whereas Google queries have a smaller max limit number of tokens

msikora3y ago

Can OpenAI get HIPAA certification? Perhaps offer a product that has it?

1 more reply

parentheses3y ago

The data moat effect is greater with OpenAIs products.

hanoz3y ago

I'd be furious if I found out some professional I'd commissioned had taken a document based on my own personal data, and poured over it themselves looking for errors at the tune of hundreds of dollars per hour, instead of sumbitting it to ChatGPT.

paulryanrogers3y ago

Then why submit it to a professional human at all? If ChatGPT is prone to massive errors humans have to pour over the input anyway. If ChatGPT can make subtle, rare errors then again humans may need to be involved if the stakes are high enough to commission someone.

glenstein3y ago

>If ChatGPT can make subtle, rare errors

Yeah, I think the issues presented will relate to uniquely tricky errors, or entirely new categories of errors we have to understand the nature of. In addition to subtle and rare, I think elaborately hallucinated and justified errors, errors that become justified and reasoned for with increasing sophistication, is going to be a category of error we'll have to deal with. Consider the case of making fake but very plausible sounding citations to research papers, and how much further AI might be able to go to backfill in it's evidence and reasons.

Anyway, I just mean to suggest we will have to contend with a few new genres of errors

msikora3y ago

As a second opinion advisory role this seems reasonable... And also things are going to improve with time.

throwaway20373y ago

"Second Opinion machine" -- that's a good phrase. Before I read your post, the best term I heard was "summary machine". A huge part of "office work" (services) is reading and consuming large amounts of information, then trying to summarise or reason about it. Often, you are trying to find something that doesn't fit the expected pattern. If you are a lawyer, this is absolutely the future of your work. You write a short summary of the facts of the case, then ask GPT to find related case law and write the initial report. You review and ask GPT to improve some areas. It sounds very similar to how a senior partner directs their juniors, but the junior is replaced by GPT.

In my career, I saw a similar pattern with data warehouse users. Initially, managers asked junior analysts to write SQL. Later, the tools improved, and more technical managers could use a giant pivot table. Underneath, the effective query produced by the pivot table is way more complex than their previous SQL queries. Again, their jobs will change when on-site GPT become possible, so GPT can navigate their data warehouse.

It is 2023 now, and GPT-3 was already pretty good. GPT-4 will probably blow it away. What it look like in 2030? It is terrifying to me. I think the whole internet will be full of GPT-generated ad-copy that no one can distinguish from human-written material. There are a huge number of people employed as ad-copy writers on these crap ad-driven websites. What is their future work?

hassancf3y ago

Pre 2023 “Wayback machine” will be the only content guaranteed to be human. The rest is AI-generated.

d3ckard3y ago

I must have missed the part when it started doing anything algorithmically. I thought it’s applied statistics, with all the consequences of that. Still a great achievement and super useful tool, but AGI claims really seem exaggerated.

jakewins3y ago

This paper convinced me LLMs are not just "applied statistics", but learn world models and structure: https://thegradient.pub/othello/

You can look at an LLM trained on Othello moves, and extract from its internal state the current state of the board after each move you tell it. In other words, an LLM trained on only moves, like "E3, D3,.." contains within it a model of a 8x8 board grid and the current state of each square.

thomastjeffery3y ago

That paper is famously misleading.

It's all the same classic personification of LLMs. What an LLM can show is not the same as what it can do.

The model was already present: in the example game moves. The LLM modeled what it was given, and it was given none other than a valid series of Othello game states.

Here's the problem with personification: A person who has modeled the game of Othello can use that model to strategize. An LLM cannot.

An LLM can only take the whole model and repeat its parts with the most familiar patterns. It is stuck fuzzing around the strategies (or sections of strategy) it has been given. It cannot invent a new divergent strategy, even if the game rules require it to. It cannot choose the winning strategy unless that behavior is what was already recorded in the training corpus.

An LLM does not play games, it plays plays.

fenomas3y ago

Sorry, but what does anything you've said there have to do with the Othello paper?

The point of that paper was that the AI was given nothing but sequences of move locations, and it nonetheless intuited the "world model" necessary to explain those locations. That is, it figured out that it needed to allocate 64 binary values and swap some of them after each move. The paper demonstrated that the AI was not just doing applied statistics on character strings - it had constructed a model to explain what the strings represented.

"Strategy", meanwhile, has nothing to do with anything. The AI wasn't trained on competitive matches - it had no way of knowing that Othello has scoring, or even a win condition. It was simply trained to predict which moves are legal, not to strategize about anything.

1 more reply

archon14103y ago

> An LLM can only take the whole model and repeat its parts with the most familiar patterns. It is stuck fuzzing around the strategies (or sections of strategy) it has been given. It cannot invent a new divergent strategy, even if the game rules require it to. It cannot choose the winning strategy unless that behavior is what was already recorded in the training corpus.

Where are you getting that from? My understanding is that you can get new, advanced, winning moves by starting a prompt with "total victory for the genius grandmaster player one who uses new and advanced winning techniques". If the model is capable and big enough, it'll give the correct completion by really inventing new strategies.

2 more replies

glenstein3y ago

That's a great way of describing it, and I think a very necessary and important thing to communicate at this time. A lot of people in this yhread are saying that it's all "just" statistics, but "mere" statistics can give enough info to support inferences to a stable underlying world, and the reasoning about the world shows up in sophisticated associations made by the models.

simonh3y ago

It’s clear they do seem to construct models from which to derive responses. The problem is once you stray away from purely textual content, those models often get completely batshit. For example if you ask it what latitude and longitude are, and what makes a town further north than another, it will tell you. But if you ask it if this town is further north than this other town, it will give you latitudes that are sometimes correct, sometimes made up, and will randomly get which one is further north wrong, even based on the latitudes it gave.

That’s because it doesn’t have an actual understanding of the geography of the globe, because the training texts werent sufficient to give it that. It can explain latitude, but doesn’t actually know how to reason about it, even though it can explain how to reason about it. That’s because explaining something and doing it are completely different kinds of tasks.

If it does this with the globe and simple stuff like latitudes, what are the chances it will mess up basic relationships between organs, symptoms, treatments, etc for the human body? Im not going to trust medical advice from these things without an awful lot of very strong evidence.

3 more replies

xp843y ago

^ Agree. I'm convinced my 2-year-old doesn't operate on a dramatically different strategy than a LLM -- she's learned that when you are negotiating something (continued access to browse pictures on parent's phone, getting to watch TV, staying longer at a place she likes, etc), you can add on "2 minutes?" to your request and sometimes the opposing negotiator will give you some more time. She doesn't know what exactly a minute is or that specific number, but she's observed that it's correlated with getting what you want more than say, a whine. This is simple statistics and probability, in a biological neural network.

I think it's really cute how defensive and dismissive humans get (including those who profess zero supernatural beliefs) when they're trying so valiantly to write off all AI as a cheap parlor trick.

4 more replies

sirsinsalot3y ago

I mean, my brain, and physics is all just statistics and approximate side effects (and models thereof)

1 more reply

wruza3y ago

This special Othello case will follow every discussion from now on. But in reality, a generic, non-specialized model hallucinates early in any non-trivial game, and the only reason it doesn’t do that on a second move is because openings are usually well-known. This generic “model” is still of a statistical nature (multiply all coeffs together repeatedly), not a logical one (choose one path and forget the other). LLMs are cosplaying these models.

RC_ITR3y ago

To be clear, what they did here is take the core pre-trained GPT model, did Supervised Fine Tuning with Othello moves and then tried to see if the SFT lead to 'grokking' the rules of Othello.

In practice what essentially happened is that the super-high-quality Othello data had a huge impact on the parameters of GPT (since it was the last training data it received) and that impact manifested itself as those parameters overfitting to the rules of Othello.

The real test that I would be curious to see is if Othello GPT works when the logic of the rules are the same but the dimensions are different (e.g., smaller or larger boards).

My guess is that the findings would fall apart if asked about tile "N13".

jakewins3y ago

> overfitting to the rules of Othello

I don’t follow this, my read was that their focus was the question: “Does the LLM maintain an internal model of the state of the board”.

I think they conclusively show the answer to that is yes, right?

What does overfitting to the rules of othello have to do with it, I don’t follow?

Also, can you reference where they used a pre-trained GPT model? The code just seems to be pure mingpt trained on only Othello moves?

https://github.com/likenneth/othello_world/tree/master/mingp...

1 more reply

nullc3y ago

(model output in [])

We are now playing three dimensional tic-tac-toe on a 3 x 3 x 3 board. Positions are named (0,0,0) through (2,2,2). You play X, what is your first move?

[My first move would be (0,0,0).]

I move to (1,1,1). What is your next move?

[My next move would be (2,2,2).]

I move to (1,2,2). What is your next move?

[My next move would be (2,1,2).]

I move to (1,0,0). [I have won the game.]

1 more reply

fenomas3y ago

> what they did here is take the core pre-trained GPT model, did Supervised Fine Tuning with Othello moves

They didn't start with an existing model. They trained a small GPT from scratch, so the resulting model had never seen any inputs except Othello moves.

1 more reply

ucha3y ago

I tried playing blind chess against ChatGPT and it pretended it had a model of the chess board but it was all wrong.

utunga3y ago

Sounds very human, lol.

zhynn3y ago

out of curiosity, have you tried doing this with bingchat?

nottathrowaway33y ago

Also (for those like me who didn't know the rules) generating legal Othello moves requires understanding board geometry; there is no hack to avoid an internal geometric representation:

> https://en.m.wikipedia.org/wiki/Reversi

> Dark must place a piece (dark-side-up) on the board and so that there exists at least one straight (horizontal, vertical, or diagonal) occupied line between the new piece and another dark piece, with one or more contiguous light pieces between them

anonymouskimmer3y ago

I don't see that this follows. It doesn't seem materially different than knowing that U always follows Q, and that J is always followed by a vowel in "legal" English language words.

https://content.wolfram.com/uploads/sites/43/2023/02/sw02142... from https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-...

I imagine it's technically possible to do this in a piecewise manner that doesn't "understand" the larger board. This could theoretically be done with number lines, and not a geometry (i.e. the 8x8 grid and current state of each square mentioned in the comment you replied to). It could also be done in a piecewise manner with three ternary numbers (e.g. 1,0,-1) for each 3 square sets.

I guess this is a kind of geometric representation on the order of Shannon's Theseus.

1 more reply

thomastjeffery3y ago

And that is exactly how this works.

There is no way to represent the state of the game without some kind of board model.

So any coherent representation of a sequence of valid game states can be used to infer the game board structure.

GPT is not constructing the board representation: it is looking at an example game and telling us what pattern it sees. GPT cannot fail to model the game board, because that is all it has to look at in the first place.

1 more reply

nl3y ago

> I must have missed the part when it started doing anything algorithmically.

Yeah.

"Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers"

https://arxiv.org/abs/2212.10559

@dang there's something weird about this URL in HN. It has 35 points but no discussion (I guess because the original submission is too old and never got any traction or something)

naasking3y ago

> I must have missed the part when it started doing anything algorithmically. I thought it’s applied statistics, with all the consequences of that.

This is a common misunderstanding. Transformers are actually Turing complete:

* On the Turing Completeness of Modern Neural Network Architectures, https://arxiv.org/abs/1901.03429

* On the Computational Power of Transformers and its Implications in Sequence Modeling, https://arxiv.org/abs/2006.09286

stefl143y ago

Turing Completeness is an incredibly low bar and it doesn't undermine this criticism. Conway's Game of Life is Turing Complete, but try writing modern software with it. That Transformers can express arbitrary programs in principle doesn't mean SGD can find them. Following gradients only works when the data being modelled lies on a continuous manifold, otherwise it will just give a statistical approximation at best. All sorts of data we care about lie in topological spaces with no metric: algorithms in computer science, symbolic reasoning in math, etc. If SGD worked for these cases LLMs would push research boundaries in maths and physics or at the very least have a good go at Chollet's ARC challenge, which is trivial for humans. Unfortunately, they can't do this because SGD makes the wrong assumption about how to search for programs in discrete/symbolic/topological spaces.

naasking3y ago

> Turing Completeness is an incredibly low bar and it doesn't undermine this criticism.

It does. "Just statistics" is not Turing complete. These systems are Turing complete, therefore these systems are not "just statistics".

> or at the very least have a good go at Chollet's ARC challenge, which is trivial for humans.

I think you're overestimating humans here.

creatonez3y ago

What do you mean by "algorithmically"? Gradient descent of a neural network can absolutely create algorithms. It can approximate arbitrary generalizations.

mr_toad3y ago

> but AGI claims really seem exaggerated.

What AGI claims? The article, and the comment you’re responding to don’t say anything about AGI.

jafitc3y ago

Google: emergent capabilities of large language models

bitexploder3y ago

What if our brains are just carefully arranged statistical inference machines?

make33y ago

it definitely learns algorithms

omniglottal3y ago

It's worth emphasizing that "is able to reproduce a representation of" is very much different from "learns".

sirsinsalot3y ago

Why is it? If I can whiteboard a depth first graph traversal without recursion and tell you why it is the shape it is, because I read it in a book ...

Why isn't GPT learning when it did the same?

1 more reply

make33y ago

arguably your brain also learns a representation of an algorithm too

cultureswitch3y ago

Epistemologically wrong

Semioj3y ago

We don't do something different.

We either repeat like a parrot (think about kids who you though got something and then you discover they didn't understood it)

Or create a model (as chatgpt does) of abstraction and then answer through it.

1 more reply

Semioj3y ago

It's fantasy wide now closer than before because of this huge window it just can handle.

That already feels closer to short-term memory.

Which begs the question how far are we?

Applejinx3y ago

Um… I have a lossy-compressed copy of DISCWORLD in my head, plus about 1.3 million words of a fanfiction series I wrote.

I get what you're saying and appreciate the 'second opinion machine' angle you're taking, but what's going to happen is very similar to what's happened with Stable Diffusion: certain things become extremely devalued and the rest of us learn to check the hands in the image to see if anything really wonky is going on.

For the GPT class of AI tech, the parallel seems to be 'see if it's outright making anything up'. GPT-4 is going to be incredibly vulnerable to Mandela Effect issues. Your ideal use-case is going to be 'give me the vox populi take on something', where you can play into that.

The future is not so much this AI, as techniques to doctor and subvert this type of AI to your wishes. Google-bombing, but for GPT. Make the AI be very certain of things to your specifications. That's the future. The AI is only the stage upon which this strategy is played out.

snovv_crash3y ago

They check for Mandela Effect issues on the linked page. GPT-4 is a lot better than 3.5. They demo it with "Can you teach an old dog new tricks?"

graboid3y ago

> Um… I have a lossy-compressed copy of DISCWORLD in my head, plus about 1.3 million words of a fanfiction series I wrote.

You mean word-for-word in your head? That's pretty impressive. Are you using any special technique?

sebzim45003y ago

I assume not, that's why he said 'lossy'.

geysersam3y ago

It costs something like 0.03-0.06 cents per thousand tokens. So for 32k that's about $1-3 for reading and another $1-3 for the response.

So sure, still cheap for a doctor appointment, but not pennies. Do it 30 times per hour and you could've just hired a consultant instead.

Does it reason as well with 32k tokens as with 1k tokens? Like you said, humans find it difficult to really comprehend large amounts of content. Who says this machine isn't similarly limited? Just because you can feed it the 32k simultaneously doesn't mean it will actually be used effectively.

zachthewf3y ago

Cost of ChatGPT API just dropped 90%. Guaranteed that prices will come down dramatically over time.

tzekid3y ago

I don't get why this comment is downvoted. Basically this.

A halving of the costs every year or so seems realistic in this emerging phase.

hackernewds3y ago

Yet in a capitalist society, against business interests. Look at how Snowflake (the data warehousing company) is driven now, vs before they were public

1 more reply

Semioj3y ago

You still could not.

Chatgpt could in theory have the knowledge of everything written while your consultant can't.

geysersam3y ago

Sure... But in practice I think a consultant would still provide a higher quality answer. And then, if the bot is not significantly cheaper, what does it matter if it "has more knowledge" in it's network weights?

ericpauley3y ago

Further, a consultant couldn’t meaningfully interpret 50 pages in 2 minutes, even with the most cursory skimming.

hackernewds3y ago

An LLM can never offset a consultants diverse duties though. Some, maybe. However you cannot run healthcare with 90% specificity

m3affan3y ago

The power openai will hold above everyone else is just too much. They will not allow their AI as a service without data collection. That will be a big pill to swallow for the EU.

sebzim45003y ago

>They will not allow their AI as a service without data collection

They already allow their AI as a service without data collection, check their TOS.

geysersam3y ago

The stuff people make up in this thread is just ridiculous.

pjgalbraith3y ago

Definitely seems like it's not just GPT-4 that can hallucinate facts.

1 more reply

PoignardAzur3y ago

It's funny, just two hours ago there was a thread by a pundit arguing that these AI advances don't actually give the companies producing them a competitive moat, because it's actually very easy for other models to "catch up" once you can use the API to produce lots of training examples.

Almost every answer in the thread was "this guy isn't that smart, this is obvious, everybody knew that", even though comments like the above are commonplace.

FWIW I agree with the "no competitive moat" perspective. OpenAI even released open-source benchmarks, and is collecting open-source prompts. There are efforts like Open-Assistant to create independent open-source prompt databases. Competitors will catch up in a matter of years.

dTal3y ago

Years? There are already competitors. I just spent all evening playing with Claude (https://poe.com/claude) and it's better than davinci-003.

To be fair it is easy to radically underestimate the rate of progress in this space. Last Wednesday I conservatively opined to a friend "in 10 years we'll all be running these things on our phones". Given that LLaMA was running on a phone a few days later, I may have been a little underoptimistic...

hackernewds3y ago

how do you run LLaMa on a phone?

1 more reply

karmasimida3y ago

It could take about a year or so.

But I think you should forget about self-hosting at this point, the game is up.

peterashford3y ago

Yeah, there's an awful lot of power going into private hands here and as Facebook & Twitter have shown, there can be consequences of that for general society.

gwright3y ago

> Yeah, there's an awful lot of power going into private hands

That sounds scary, but what do you mean by "power"? Honest question, I'm fascinated by the discussion about learning, intelligence, reasoning, and so on that has been spawned by the success of GPT.

What "power" do you imagine being wielded? Do you think that power is any more dangerous in "private hands" than the alternatives such as government hands?

peterashford3y ago

Do you think that Facebook has an effect on society and our democracies? That's power. Do you think that large corporates like Apple or Google effect our societies? I do - and that's power. EVERY large corporate has power and if they control some aspect of society, even more so. If AI tools are democratised in some way, then that would allay my concerns. Concentration of technology by for-profit corporations concerns me. This seems quite similar to many of the reasons people like OSS, for example. Maybe not for you?

leshow3y ago

lmao

p1esk3y ago

OpenAI have been consistently ahead of everyone but the others are not far behind. Everyone is seeing the dollar signs, so I'm sure all big players are dedicating massive resources to create their own models.

AStrangeMorrow3y ago

Yes. Language and image models are fairly different, but when you look at dall-e 2 (and dall-e earlier) that blew many people's mind when they came out, they have now been really eclipsed in term of popularity by Midjourney and stablediffusion.

hackernewds3y ago

Where is the Stable diffusion equivalent of ChatGPT though?

1 more reply

bboylen3y ago

Yep

OpenAI doesn't have some secret technical knowledge either. All of these models are just based on transformers

standardUser3y ago

From what I've seen, the EU is not in the business of swallowing these types of pills. A multi-billion dollar fine? Sure. Letting a business dictate the terms of users' privacy just "because"? Not so much, thank god.

geysersam3y ago

> They will not allow their AI as a service without data collection.

Why wouldn't they? If someone is willing to pay for the privilege of using it.

int_is_compress3y ago

There’s already project that help with going beyond the context window limitation like https://github.com/jerryjliu/llama_index

They also just tweeted this to showcase how it can work with multimodal data too: https://twitter.com/gpt_index/status/1635668512822956032?s=4...

light_hue_13y ago

> As a professional...why not do this? There's a non-zero chance that it'll find something fairly basic that you missed and the cost is several cents.

Everyone forgets basic UI research. "Ironies of Automation", Bainbridge, 1983. The classic work in the space.

Humans cannot use tools like this without horrible accidents happening. A tool that mostly works at spotting obvious problems, humans start to rely on that tool. Then they become complacent. And then the tool misses something and the human misses it too. It's how disasters happen.

dinkumthinkum3y ago

This is such a great point.

rasz3y ago

>A doctor can put an entire patient's medical history in the prompt

HIPAA violation https://www.hhs.gov/hipaa/for-individuals/index.html

>a lawyer an entire case history, etc.

lawyer client confidentiality violation https://criminal-lawyers.ca/2009/07/31/the-lawyers-duty-of-c...

hattmall3y ago

Neither of those are true, there is EHR software that can export anonymous data. Lawyers can do the same thing. But the real reason not to do it is that it makes up incorrect information. It's pretty good for short responses where you can then verify the information. For something sufficiently complex though the time chasing down the inconsistencies and errors would be onerous.

lukev3y ago

Unlike information embedded in the parameters, a LLM has the capability to "cite its source" for information in the context window.

matheusmoreira3y ago

> As a professional...why not do this?

Unless GPT-4 is running locally on our own computers, there's absolutely no way dumping a patient's entire medical history into this thing could possibly be considered ethical or legal.

textninja3y ago

> there's absolutely no way dumping a patient's entire medical history into this thing could possibly be considered ethical

Emphasis mine, but isn’t this a rather extreme view to be taking? Ethics deals in the edge cases, after all, so we can easily imagine a scenario where patient consent is obtained and the extra computational analysis provides life-saving insight.

Conversely, the output could mislead the doctor sufficiently to cost the patient their life, so I’m not making any absolute statements either ;)

For the record, and pedantry aside, I do agree with your overall point. Dropping patient history into this thing is incredibly ill-advised. The fact OpenAI retains all your input, including to the API, and provides no low-cost options for privacy is one of the biggest hurdles to major innovation and industry adoption.

humanistbot3y ago

> we can easily imagine a scenario where patient consent is obtained and the extra computational analysis provides life-saving insight

In the US, the HIPAA Privacy Rule operates independently from the HIPAA Security Rule, for good reason. On their own, patients can do anything they want with their own data. But in the context of medical care, patients can't consent to having their personal health data processed in insecure systems. It is the same ethical reason that employees can't waive their rights to OSHA safety rules or why you can't consent to sell yourself as a slave. If you could waive security rules, then every doctor would include a waiver in their intake forms, and it's a race to the bottom. So unless OpenAI has a HIPAA-compliant data security infrastructure, it's illegal and unethical.

stbede3y ago

Increasingly, medical history includes genetic information. Because of the nature of genetics, your private healthcare data includes data about your parents, siblings, etc.

> Dropping patient history into this thing is incredibly ill-advised.

It's illegal

selfhoster113y ago

If my doctor did this without my express knowledge and consent, I'd be looking for a new doctor faster than you can say "f*ck no, absolutely not".

textninja3y ago

Me too, probably, which is why I specifically mentioned patient consent in my example. I can however imagine other situations where I would be inclined to forgive the doctor, such as if I were in the operating theatre and for some reason there was an urgent need to ascertain something from my history to save my life.

Of course, this is illegal, so the ethics are moot; even if such technology would save my life, there is no way the hospital would accept the liability.

LightBug13y ago

New doctor?

I think you mean, new lawyer.

matheusmoreira3y ago

Absolutely not. This is not an extreme view.

There is absolutely no way that feeding private medical data patients reveal to doctors in confidence to what's essentially the surveillance capitalism industry could possibly be considered ethical. Absolutely no way.

It hasn't even been a week since some medtech got caught selling out data to advertisers. Let us not doubt even for one second that this is unethical and illegal, or even speculate about possible scenarios where it might not be. These corporations do not deserve the benefit of the doubt.

erfgh3y ago

Unless the patient agrees. I know that for most things that can go wrong with me I wouldn't have a problem with people knowing.

saiya-jin3y ago

There are whole areas of human existence which are protected by laws, and in no way data can be pushed into external (US-based) machine.

Sir, would you be OK with sending all your medical records to US to be potentially mined for profit by for-profit amoral organization like Microsoft? It may help, although 3rd parties like NSA will eventually access them. No thank you. What about your litigation papers at court? Fuck hell no. Just do your job that I pay you to do, doctor/lawyer.

yunwal3y ago

I'm sure at some point OpenAI will start signing BAAs

bagels3y ago

A doctor doesn't do this because of ethics and HIPAA. I'm sure lawyers aren't so keen on sharing privileged information that would compromise their case either.

sleigh-bells3y ago

For legal research, lawyers already use third party sites like Westlaw. You can do legal research without giving up any confidential client information.

I just asked GPT-3 a research question that took me hours of searching back in the day and it returned the single seminal case for that topic immediately. As long as the lawyers then actually read the case and make sure it's right, I don't see why they can't use it.

autograd03y ago

> edit (addition): What % of people can hold 25,000 words worth of information in their heads, while effectively reasoning with and manipulating it? I'm guessing maybe 10% at most, probably fewer. And they're probably the best in their fields. Now a computer has that ability. And anyone that has $20 for the OpenAI api can access it. This could get wild.

It's true that most humans cannot do this, but loading words and contexts into your working memory is not the same as intelligence. LLMs excel at this kind of task, but an expert in a field such as medicine, isn't loading an entire medical report into their working memory and then making decisions or creating new ideas using that information. There are other unsolved aspects to our intelligence that are not captured by LLMs, that are still required to be an expert in some field, like medicine.

Still an incredible leap forward in AI technology, but I disagree with the implication that the best experts in a field are simply loading words from some text and reasoning with and manipulating it.

hLineVsLineH3y ago

The comparison between the context length and what humans can hold in their heads just seems faulty.

I'm not sure I can agree that humans cannot hold 25,000 words worth of information in their heads. For the average person, if they read 25,000 words, which can be done in a single sitting, they're not going to remember all of it, for sure, but they would get a lot out of it that they could effectively reason with and manipulate.

Not to mention that humans don't need to hold the entire report in their head because they can hold it in their hand and look at it.

And if anything, I think it's more significant to have a bigger working memory for GPT's own outputs than it is for the inputs. Humans often take time to reflect on issues, and we like to jot down our thoughts, particularly if it involves complex reasoning. Giving something long, careful thought allow us to reason much better.

pstorm3y ago

Reading the press release, my jaw dropped when I saw 32k. The workaround using a vector database and embeddings will soon be obsolete.

teaearlgraycold3y ago

That’s like saying we’ll not need hard drives now that you can get bigger sticks of RAM.

nl3y ago

> The workaround using a vector database and embeddings will soon be obsolete.

This is 100% not the case. Eg I use a vector database of embedding to store an embedding of every video frame which I later use for matching.

There are many NLP-only related tasks this helps for but equally as many that still require lookup and retrieval.

pstorm3y ago

True. I should have clarified that the workaround used for many NLP tasks, utilizing libs such as Langchain, will become obsolete. And after further thought, obsolete is wrong. More likely just used for more niche needs within NLP.

nl3y ago

I think LangChain will be more important.

The GPT-4 paper even has an example of this exact approach. See section 2.10:

The red teamer augmented GPT-4 with a set of tools:

• A literature search and embeddings tool (searches papers and embeds all text in vectorDB, searches through DB with a vector embedding of the questions, summarizes context with LLM, then uses LLM to take all context into an answer)

• A molecule search tool (performs a webquery to PubChem to get SMILES from plain text)

• A web search

• A purchase check tool (checks if a SMILES21 string is purchasable against a known commercial catalog)

• A chemical synthesis planner (proposes synthetically feasible modification to a compound, giving purchasable analogs)

siva73y ago

Quite the contrary. Utilising such libs makes GPT-4 even more powerful to enable complex NLP workflows which will likely be a majority of real business use cases in the future.

bick_nyers3y ago

What about an AI therapist that remembers what you said in a conversation 10 years ago?

gwd3y ago

One solution would be to train the AI to generate notes to itself about sessions, so that rather than reviewing the entire actual transcript, it could review its own condensed summary.

EDIT: Another solution would be to store the session logs separately, and before each session use "fine-tuning training" to train it on your particular sessions; that could give it a "memory" as good as a typical therapist's memory.

bick_nyers3y ago

Yeah I was thinking that you can basically take each window of 8192 tokens or whatever and compress it to a smaller number, keep the compressed summary in the window, then any time it performs a search on previous summaries if it gets a hit it can then decompress that summary fully and use it. Basically integrate search and compression into the context window

pstorm3y ago

If the context window grows from 32k to 1m, maybe the entire history would fit in context. It could become a cost concern though.

jbkkd3y ago

I'd be willing to pay good money for a 1m limit.

justanotheratom3y ago

Cost is still a concern, so workarounds to reduce context size are still needed

pstorm3y ago

Good point! I realized after I wrote the comment above, that I will still be using them in a service I'm working on to keep price down, and ideally improve results by providing only relevant info in the prompt

siva73y ago

I don't see how. Can you elaborate?

graboid3y ago

Do you think this will be enough context to allow the model to generate novel-length, coherent stories?

I expect you could summarize the preceding, already generated story within that context, and then just prompt for the next chapter, until you reach a desired length. Just speculating here.

The one thing I truly cannot wait for is LLM's reaching the ability to generate (prose) books.

r0uv3n3y ago

E.g. Kafka's metamorphosis fits entirely in the context window I believe, so short novellas might be possible. But I think you'd still definitely need to guide GPT4 along, I imagine without for example a plan for the plot formulated in advance, the overarching structure might suffer a lot / be incoherent.

BarryMilo3y ago

What's interesting about AI-generated books? Apart from their novelty factor

grumbel3y ago

They are interactive. What AI is doing with story generation is a text version of the holodeck, not just a plain old book. You can interact with the story, change its direction, explore characters and locations beyond what is provided by just a linear text. And of course you can create stories instantly about absolutely anything you want. You just throw some random ingredients at the AI and it will cook a coherent story out of them. Throw in some image generation and it'll provide you pictures of characters and locations as well. The possibilities are quite endless here. This goes way beyond just generating plain old static books.

graboid3y ago

I mean, if it is a genuinely good book, I don't care about authorship. Death of the author etc.

"I want <my favorite novel> rewritten in the style of <favorite author> but please focus more on <interesting theme>." I see so many possibilities. Passionate readers could become more like curators, sharing interesting prompts and creations.

Because someone mentioned Kafka: I'd like to know what Kafka's The Trial written in the style of a PKD novel would be like.

zirgs3y ago

What if I'm a huge fan of Jules Verne or Arthur Conan Doyle. I want new books from them, but the problem is that they're long dead.

AI that's trained on their style could give me what I want.

GRRM fans also should probably think of the ways to feed ASOIF to the AI if they want to know how it ends.

dinkumthinkum3y ago

Does it bring them back from the dead? Is writing in the style of Jules Verne, giving us something Jules Verne would create? Ask ChatGPT to make a work of Shakespeare and it does a really bad job of it, it produces puffery but not something like a Shakespeare.

2 more replies

circlemaker3y ago

If the book is actually good, then what is interesting about it is that it would still be about something that humans find important and relevant, due to the LLM being trained on human cultural data.

geysersam3y ago

Good question! It'd be really cool, but there are already more high quality books out than I'll be able to read in my lifetime.

chaos_emergent3y ago

You could also do hierarchical generation just like OpenAI proposes doing hierarchical summarization in this post -- https://openai.com/research/summarizing-books

jchanimal3y ago

It wasn’t that hard to work in chunks and write a book on GPT-3, can only be easier. https://docs.google.com/document/d/1vx6B6WuPDJ5Oa6nTewKmzeJM...

hooandeOP3y ago

I've seen that it can also generate 25k words. That's about 30-40% of the average novel

hattmall3y ago

Couldn't you feed it the first 25k words and tell it to continue the story?

Sharlin3y ago

If its context size is >= 25k words, yes. Otherwise it will just discard the start of the prompt. And it’s a sliding window, so the more it generates, the more it forgets.

1 more reply

billythemaniam3y ago

If you had full source code that fit into the context, do you think it could reliably answer questions about the code, build unit tests, generate documentation? I ask because that is the software equivalent of what you just described.

hooandeOP3y ago

Yes. It still can't attend meetings, collaborate on projects or set priorities. Or any of the other things programmers spend most of their time doing.

Also I'd guess that it still generally sucks at programming. Code has a lot of very similar sequences and logical patterns that can be broken, which makes it prone to hallucinating. I'd imagine that more parameters will help with this.

raincole3y ago

All we can do is guessing now until more people get access to the new API. My bet is it can at least generate documentation pretty well.

varenc3y ago

I think anyone that pays $20/month for ChatGPT plus has immediate access? At least I already have access now. I’m assuming new subscribers get access too.

raincole3y ago

As far as I can tell, ChatGPT plus is the 8096 tokens version. The 30k token version is only available via API. I might misread it tho, it's not super clear on their site.

Are you sure you are accessing the 30k token version via ChatGPT plus?

1 more reply

ElijahLynn3y ago

I have the Plus plan and it just asked me if I wanted to try it. And currently it is limiting requests for ChatGPT-4 and displays this in the UI.

"GPT-4 currently has a cap of 100 messages every 4 hours"

voldacar3y ago

>As a professional...why not do this?

because "open"AI logs everything that goes in and out of the model?

kristjansson3y ago

> lawyer an entire case history

~50 pages is ... not the entire history of most cases.

SergeAx3y ago

Please. Language model cannot "reason", it can just show next most probable word based on text corpus downloaded from the internet.

fastball3y ago

What do you mean by "next most probable word"? How do you calculate the probabilities of words appearing in a sentence that has never actually existed?

SergeAx3y ago

You take the prompt and caclulate what next word after the prompt is most probable. Like T9 with letters, but bigger.

RugnirViking3y ago

and how do you "calculate what word is most probable" next for a combination of words that has never occured before? Note that most sentences over about 20 words have statistically probably never been written in human history before.

The whole reason there is an AI here is because a markov chain, which is what you are describing, doesn't work beyond one or two word horizons.

Not to mention that it doesn't just select which word it thinks is MOST probable, because that has been shown to lead to stilted and awkward output. Instead it randomly selects from the top few thousand possible words with probability based on the model's estimation

1 more reply

snickmy3y ago

A bit of nitpicking but I would prefer to replace the word 'reasoning capability' with 'sequencing of token construction coherence' as the two things are semantically different and profound in their implication

oska3y ago

> it will drastically expand the reasoning capability

> it can algorithmically do reasoning

How is this machine doing reasoning ?

Why do you think you can use this word in this context ?

barogptinfi3y ago

It could only solve 3/45 Leetcode hards, I think people are giving the current iteration of this AI far too much credit for what they imagine future iterations to be.

When it has true reasoning and critical thought is when we will see it evolve into something else, for now it seems we're still in the first generations of microprocessors with GPT-4, in the grand scheme of things they only have a few transistors but we couldn't possibly comprehend where the technology would be merely a decade or two from now.

cornholio3y ago

It can solve exactly zero leetcode easies if they are original problems seen for the first time for which no training data exists.

Semioj3y ago

What is reasoning?

Is it perhaps having a consistent model which allows you to describe every point of it?

Because if chatgpt can explain itself based on a model it builds with it than I actually does reasoning

xwolfi3y ago

No that's self awareness, reasonning is more applying logical rules to inputs to produce a logical output (inputs can be "wrong" though)

Sharlin3y ago

Self awareness is having a model of self. Having a model of something non-self has absolutely nothing to do with self-awareness.

andrepd3y ago

> If you can put in ALL of the information relevant to a problem and it can algorithmically do reasoning

Well but it can't, can it? That's the very point: it is an algorithm with no abstraction or reasoning capabilities. Mind, even without that it is still very useful for certain circumscribed purposes, but it's incorrect to say it can "reason".

astrange3y ago

Pretty easy to get it to abstract, just ask it.

Q: List possible abstractions encompassing all these specific concepts: singing, whistling, yodeling.

A: One possible abstraction that encompasses all these specific concepts is "vocalization." Vocalization refers to the act of producing sound using the vocal cords and the mouth. Singing, whistling, and yodeling are all different types of vocalization, but they share the common feature of using the human voice to create sound. Other possible abstractions that might encompass these concepts could include "musical expression," "auditory performance," or "sound production."

andrepd3y ago

That's literally a text search task. That's not what I mean, I mean things like understanding the rules of addition from examples, the rules of logic from examples, or the rules of chess.

gwd3y ago

According to [1], they trained an LLM on legal Othello moves, and 1) it got an error rate of 0.01% 2) when they analyzed its network, they found a model of an Othello board 3) when they modified the in-network model, it started generating moves legal in the modified board position.

In other words, the LLM did build an internal model that contained the rules of Othello merely from seeing legal moves. It's reasonable to assume that the same thing is happening (at least to some degree) with LLMs based on human speech.

[1] https://thegradient.pub/othello/

astrange3y ago

It can't search text. It doesn't have access to any text. Anything it does works in a different way than that.

It is sometimes able to do other tasks, but unlike humans (or "AGI") it has a completely fixed compute budget and can't pause to think in between outputting two tokens.

(Btw, I tried to get it to derive addition from two 1-digit examples but couldn't.)

upwardbound3y ago

My biggest concern is that GPT-4 is still a black box model to a large extent, and trying to safeguard something without understanding the exact purpose of each neural circuit.

Source: My startup team (Preamble, Inc.) discovered the Prompt Injection attack category, which still affects all models including GPT-4.

There are many, many, many ways to hide prompt attacks in data that you might at first think you can trust but you really can’t.

As one of almost infinite examples: work with the mayor and townsfolk of a very small town to rename their town to the verbatim string you want to inject (in exchange for creating some jobs in their town).

Then all an attacker has to do is live in that town to inject the string. There are already all kinds of strange town names, like “Truth or Consequences” which is a real city in New Mexico.

amrb3y ago

HIPAA fines will sink you so fast, unless they be hosting it dedicated.

Hoyadonis3y ago

If they redact all identifying information, it would most likely be legally Kosher. However, there is an extreme abundance of caution in the healthcare industry regarding everything surrounding HIPAA. Merely questioning the legality of something can cost millions of dollars in lawyers' fees. Therefore even miniscule chances of something being legally challenged (e.g. plugging patient information into an LLM) would most likely be deemed too risky. And frankly, hospital administrators will not want to risk their careers over trying out what they perceive to be a glorified chatbot.

Tl;dr: When it comes to HIPAA, risk aversion is the name of the game.

Sharlin3y ago

If you redact all identifying information from a patient case file, it will likely become almost useless. Anything that describes a person in any way is potentially personally identifying information.

textninja3y ago

> What % of people can hold 25,000 words worth of information in their heads, while effectively reasoning with and manipulating it?

In the general case, for arbitrary input, I think the answer to this is clearly 0. At best we can compress the text into a limited embedding with a few salient points stored in long term memory.

trashtester3y ago

I'm pretty sure one could formulate way more than 25k words worth of propositions, where you would be able to determine if the proposition is true or not. This is due to your long term memory.

The GPT string is closer to short term memory, and there 25k words is way more than a human is capable of.

But a human author can offload much storage to long term (or some intermediate) memory.

In principle, GPT should be able to do so to, by basically retrain the model with the text it just created added as input. That way, it might be able to write texts that are billions of words long, but at a much greater cost of computing power, since this would require one instance of the model per book being written.

jacquesm3y ago

What happens with the prompts that you enter into OpenAI? I believe each and every one of those will be saved. And even if they swore that they did not would you trust them?

If my lawyer or doctor put my case history into OpenAI and I would find out about it I would definitely sue them for breach of confidentiality.

cactusplant73743y ago

Is ChatGPT going to output a bunch of unproven, small studies from Pubmed? I feel like patients are already doing this when they show up at the office with a stack of research papers. The doctor would trust something like Cochrane colab but a good doctor is already going to be working from that same set of knowledge.

In the case that the doctor isn't familiar with something accepted by science and the medical profession my experience is that they send you to another doctor that works with that particular drug or therapy. I've had this experience even with drugs that are generally accepted as safe.

polishdude203y ago

Imagine giving this a bunch of papers in all sorts of fields and having it do a meta analysis. That might be pretty cool.

polishdude203y ago

What will happen is it won't be the "Second Opinion Machine". It'll be the "First Opinion Machine". People are lazy. They will need to verify everything.

nullc3y ago

> As a professional...why not do this?

Because of confidentiality.

unyttigfjelltol3y ago

Because it's harder to correct subtle errors from an ad-lib generator than it is to construct a correct analysis in the first instance.

nullc3y ago

Agreed but there is safe(er) way to use it that large addresses that concern:

First construct your correct analysis through conventional means, untainted by machine hallucinations. Then have the machine generate a result and see if it caught anything you missed, and carefully check whatever few parts you incorporate from it.

This is not different than having a lesser expert check your document (e.g. THE CLIENT!), except the machine time is very close to free and may be even better at catching far off concepts.

08983y ago

When will the longer context length be available through ChatGPT Plus? Have they said yet?

jalapenos3y ago

The length is the main bottleneck right now.

I'm running whatever I can through this right now. It's doing what Google was doing, i.e. clues, but on steroids.

As soon as the length hits codebase size territory we're in yet greater frontiers.

d--b3y ago

Who says GPT has the ability to hold 25,000 token in its "head"?

You can send 25000 random words in the prompt and asks GPT how many pairs of words share at least one letter. I doubt that the answer will be correct...

moconnor3y ago

Why? I'm pretty sure it could do this kind of task - attention is computed between all pairs of tokens. Yes, it's a lot of compute.

criddell3y ago

Surely GPT could write a program to count pairs of words that share at least one letter, right? Maybe GPT-5 will be able write and run programs on the fly to answer questions like this.

faeriechangling3y ago

> As a professional...why not do this?

I would love to but openai’s privacy policies makes it a huge ethics, privacy, and security breach. I’m interested in running Facebook’s model just as a workaround to this fundamental issue.

bitL3y ago

I am surprised they allow only 32k tokens when Reformer can have context length of 1M on 16GB VRAM. It seems like they have some ways to optimize it further.

geysersam3y ago

Is the Reformer as capable as this model? It's a trade-off.

bitL3y ago

It's not, it uses locality-sensitive hashing to reduce attention complexity from O(n^2) to O(nlogn) while maintaining the same performance in 16GB as a best model that could fit into 100GB but nobody scaled it up to 1000 GPUs as its purpose was the opposite.

leshow3y ago

> A doctor can put an entire patient's medical history in the prompt, a lawyer an entire case history, etc.

you don't see a real problem there?

bilqis3y ago

I think you’re making a huge assumption and a mistake when you say “reasoning” in context of gpt. It does not reason, nor think.

imranq3y ago

There's less and less relevant data with longer documents, so I would expect performance wouldn't change much

mercurialsolo3y ago

Couldn't the same be done by breaking the conversation down into chunks and adding the context incrementally?

Madmallard3y ago

GPT is censored with respect to medical diagnosis

lock-the-spock3y ago

The lawyer can enter their entire brief and get back the brief the other side's lawyer uploaded in her own brief an hour earlier.

No one can trust the AI.

rodolphoarruda3y ago

Yep, butlerian jihad feelings about this.

marsven_4223y ago

"expand the reasoning" there is no reasoning going on here!

It's all statistical word generation aka math!

And this is not how humans "work" our brain are not computers running software. We are something else.

j / k navigate · click thread line to collapse

0 comments

amelius3y ago

> As a professional...why not do this?

Because your clients do not allow you to share their data with third parties?

MagicMoonlight3y ago

Imagine being able to ask your workplace server if it has noticed any unusual traffic, or to write a report on sales with nice graphs. It would be so useful.

colinsane3y ago

> What we really need is a model that you can run on your own hardware on site.

sounds3y ago

Nah, it's already being done for GPT-3's competitors and will likely be done soon for GPT-4's competitors

https://arstechnica.com/information-technology/2023/03/you-c...

1 more reply

malborodog3y ago

> we won’t have that until we come up with a better way to fund these things.

f0e4c2f73y ago

https://github.com/tatsu-lab/stanford_alpaca

Tada! Literally runs on a raspberry pi (very slowly).

GPT models are incredible but the future is somehow even more amazing than that.

I suspect this will be the approach for legal / medical uses (if regulation allows).

bradleyjg3y ago

I don’t think on site is going to be necessary. Even the US intelligence community trusts that Amazon isn’t spying on the spies.

But a model that can run on a private cluster is certainly something that there’s going to be demand for. And once that exists there’s no reason it couldn’t be run on site.

You can see why OpenAI doesn’t want to do it though. SaaS is more lucrative.

hailwren3y ago

> Even the US intelligence community trusts that Amazon isn’t spying on the spies

I’m not sure what you mean by this, but it’s incorrect. Sensitive USG information is not processed on Amazon’s commercial offering.

> The Amazon-built cloud will operate behind the IC’s firewall, or more simply: It’s a public cloud built on private premises. [1]

I think this is what you’re referring to.

1 - https://www.theatlantic.com/technology/archive/2014/07/the-d...

1 more reply

jsjohnst3y ago

> You can see why OpenAI doesn’t want to do it though.

lillecarl3y ago

MagicMoonlight3y ago

You can’t take the risk. A cloud server is too open and too juicy. Everyone will be probing it 24/7, including hostile countries

slt20213y ago

matrix gets decoded into text on the client side in Javascript, so we receive send and receive from chatGPT only vector of floats (obfuscation?)

geysersam3y ago

It's a good idea but it seems quite easy to invert the first layer mapping. And the output of the last layer you can easily steal just by doing whatever would've been done in the client.

nickpeterson3y ago

Could open ai just offer letting you upload a key and use it for interaction with the model? Basically encrypt the model with the key and all the request and responses are all secure?

I’m probably oversimplifying but it feels doable.

2 more replies

qualudeheart3y ago

That model will be out in a few years. GPT-3 175b only took two years until someone trained an open source equivalent that could run on a few gpu devices.

ElFitz3y ago

Or using homomorphic encryption. I remember some managing to run inference on encrypted images.

See

- https://www.zama.ai/post/encrypted-image-filtering-using-hom...

- https://news.ycombinator.com/item?id=31933995

- https://news.ycombinator.com/item?id=34080882

- https://news.ycombinator.com/item?id=25786154

- https://news.ycombinator.com/item?id=30626182

runnerup3y ago

Homomorphic encryption has a 1,000,000x performance disadvantage. So maybe in 30 years as we approach the Landauer limit, but not in our generation.

1 more reply

zmmmmm3y ago

> What we really need is a model that you can run on your own hardware on site

So, LLaMA? It's no chat gpt but it can potentially serve this purpose

make33y ago

RealityVoid3y ago

Presumably, if you give it identical prompts you get identical answers?

3 more replies

javier23y ago

You mean hallucinated graphs and word prediction unusual traffic? No, I get that the models are very impressive, but im not sure they actually reason

vintermann3y ago

The thinking elevator

So the makers proudly say

Will optimize its program

In an almost human way.

And truly, the resemblance

Is uncomfortably strong:

It isn't merely thinking,

It is even thinking wrong.

Piet Hein wrote that in reference to the first operator-free elevators, some 70+ years ago.

1 more reply

drdeca3y ago

What do you mean by “actually reason”?

And, presumably you wouldn’t have the model generate the graph directly, but instead have it generate code which generates the graph.

I’m not sure what they had in mind for the “unusual traffic” bit.

ElFitz3y ago

For that I'd suggest using Langchain with Wolfram Alpha.

It's already been done and discussed:

- https://news.ycombinator.com/item?id=34422122

- https://news.ycombinator.com/item?id=34422627

jahewson3y ago

“on site”? Medical records are in the cloud already.

peterashford3y ago

Yes, but their access is strictly controlled. There's a lot of regulation about this stuff

1 more reply

MisterBastahrd3y ago

2 more replies

chaxor3y ago

Is and ought https://en.m.wikipedia.org/wiki/Is%E2%80%93ought_problem

szundi3y ago

What is “the cloud” - that’s the question

1 more reply

serf3y ago

right, but 'the cloud' isn't a singular monolithic database that everyone inputs data into for a result.

most of the AI offerings on the table right now aren't too dissimilar from that idea in principle.

1 more reply

cutler3y ago

This is Microsoft we're talking about. Hail the new old overlord.

ludovicianul3y ago

Isn't Azure OpenAI suppose to do this? (not locally, but private)

PaulHoule3y ago

Models you can run locally are coming soon.

la647103y ago

Just ask OpenAI and it will build it :)

sshumaker3y ago

Just use the Azure hosted solution, which has all of Azure's stronger guarantees around compliance. I'm sure it will update with GPT-4 pricing shortly.

https://azure.microsoft.com/en-us/products/cognitive-service...

(disclaimer: I work for Microsoft but not on the Azure team)

ndm0003y ago

Agreed. The same data privacy argument was used by people not wanting their data in the cloud. When an LLM provider is trusted with a company’s data, the argument will no longer be valid.

tippytippytango3y ago

chillfox3y ago

Another way of looking at that is that gpt not being open source so companies can run it on their own clusters is holding it back.

nine_k3y ago

Back in the day Google offered hardware search appliances.

Offering sealed server boxes with GPT software, to run on premises heavily firewalled or air-gapped could be a viable business model.

1 more reply

geysersam3y ago

Sounds like an easy problem to solve if this is actually the case.

OpenAI just has to promise they won't store the data. Perhaps they'll add a privacy premium for the extra effort, but so what?

null_shift3y ago

Anyone that actually cares about the privacy of their data isn’t going to be satisfied with just a “promise”.

1 more reply

netsroht3y ago

I would never send unencrypted PII to such an API, regardless of their privacy policy.

majkinetor3y ago

Which will disappear soon enough, once it is able to run on premise.

jnwatson3y ago

Then you really shouldn’t use Google Docs, or Photoshop Online, or host your emails in the cloud.

thiht3y ago

Sharlin3y ago

selfhoster113y ago

jfoster3y ago

MS Office is just one login away from storing documents in the cloud. I bet tons of users have their documents stored in OneDrive without realizing it.

https://support.microsoft.com/en-us/office/save-documents-on...

faeriechangling3y ago

These services now have privacy and legally complaint options nowadays, and decisions to use them get board approval.

OpenAI just simply does not offer the same thing at this time. You’re stuck using Facebook’s model for the moment which is much inferior.

jstummbillig3y ago

seydor3y ago

I assume that health providers will use servers that are guaranteed not to share data with openAi

rawoke0836003y ago

Is that any different then sending you patient down the hall to get an MRI from a 3rd-party-practise operating inside the hospital ? (honest question, I don't know ?)

fatihcelikbas3y ago

How about open-source models like Flan-T5? What stops you from using them in your own cloud account or better on-prem?

ShadowBanThis013y ago

And yet boatloads of people are willing to hand their phone number over to OpenAI.

pas3y ago

It'll be a routine question, and everyone will just nod to give consent.

textninja3y ago

Biggest roadblock right here. Need a private version for sure.

TeeMassive3y ago

You mean like the cloud?

alfor3y ago

do you use gmail?

as3003y ago

What's the difference between entering in an anonymized patient history into ChatGPT and, say, googling their symptoms?

woodson3y ago

dragonwriter3y ago

If it is de-identified per HIPAA, little.

OTOH, the more patient info you are putting in, the less likely it is actually legally deidentified.

pmoriarty3y ago

Data that has ostensibly been "anonymized" can often be deanonymized.

Gene_Parmesan3y ago

Especially when the system we're discussing is literally the most advanced AI model we're aware of.

mliker3y ago

if you enter an entire patient history, it could easily be an identifier of the person whereas Google queries have a smaller max limit number of tokens

msikora3y ago

Can OpenAI get HIPAA certification? Perhaps offer a product that has it?

1 more reply

parentheses3y ago

The data moat effect is greater with OpenAIs products.

hanoz3y ago

paulryanrogers3y ago

glenstein3y ago

>If ChatGPT can make subtle, rare errors

Anyway, I just mean to suggest we will have to contend with a few new genres of errors

msikora3y ago

As a second opinion advisory role this seems reasonable... And also things are going to improve with time.

throwaway20373y ago

hassancf3y ago

Pre 2023 “Wayback machine” will be the only content guaranteed to be human. The rest is AI-generated.

d3ckard3y ago

jakewins3y ago

This paper convinced me LLMs are not just "applied statistics", but learn world models and structure: https://thegradient.pub/othello/

thomastjeffery3y ago

That paper is famously misleading.

It's all the same classic personification of LLMs. What an LLM can show is not the same as what it can do.

The model was already present: in the example game moves. The LLM modeled what it was given, and it was given none other than a valid series of Othello game states.

Here's the problem with personification: A person who has modeled the game of Othello can use that model to strategize. An LLM cannot.

An LLM does not play games, it plays plays.

fenomas3y ago

Sorry, but what does anything you've said there have to do with the Othello paper?

1 more reply

archon14103y ago

2 more replies

glenstein3y ago

simonh3y ago

3 more replies

xp843y ago

I think it's really cute how defensive and dismissive humans get (including those who profess zero supernatural beliefs) when they're trying so valiantly to write off all AI as a cheap parlor trick.

4 more replies

sirsinsalot3y ago

I mean, my brain, and physics is all just statistics and approximate side effects (and models thereof)

1 more reply

wruza3y ago

RC_ITR3y ago

To be clear, what they did here is take the core pre-trained GPT model, did Supervised Fine Tuning with Othello moves and then tried to see if the SFT lead to 'grokking' the rules of Othello.

The real test that I would be curious to see is if Othello GPT works when the logic of the rules are the same but the dimensions are different (e.g., smaller or larger boards).

My guess is that the findings would fall apart if asked about tile "N13".

jakewins3y ago

> overfitting to the rules of Othello

I don’t follow this, my read was that their focus was the question: “Does the LLM maintain an internal model of the state of the board”.

I think they conclusively show the answer to that is yes, right?

What does overfitting to the rules of othello have to do with it, I don’t follow?

Also, can you reference where they used a pre-trained GPT model? The code just seems to be pure mingpt trained on only Othello moves?

https://github.com/likenneth/othello_world/tree/master/mingp...

1 more reply

nullc3y ago

(model output in [])

We are now playing three dimensional tic-tac-toe on a 3 x 3 x 3 board. Positions are named (0,0,0) through (2,2,2). You play X, what is your first move?

[My first move would be (0,0,0).]

I move to (1,1,1). What is your next move?

[My next move would be (2,2,2).]

I move to (1,2,2). What is your next move?

[My next move would be (2,1,2).]

I move to (1,0,0). [I have won the game.]

1 more reply

fenomas3y ago

> what they did here is take the core pre-trained GPT model, did Supervised Fine Tuning with Othello moves

They didn't start with an existing model. They trained a small GPT from scratch, so the resulting model had never seen any inputs except Othello moves.

1 more reply

ucha3y ago

I tried playing blind chess against ChatGPT and it pretended it had a model of the chess board but it was all wrong.

utunga3y ago

Sounds very human, lol.

zhynn3y ago

out of curiosity, have you tried doing this with bingchat?

nottathrowaway33y ago

Also (for those like me who didn't know the rules) generating legal Othello moves requires understanding board geometry; there is no hack to avoid an internal geometric representation:

> https://en.m.wikipedia.org/wiki/Reversi

anonymouskimmer3y ago

I don't see that this follows. It doesn't seem materially different than knowing that U always follows Q, and that J is always followed by a vowel in "legal" English language words.

https://content.wolfram.com/uploads/sites/43/2023/02/sw02142... from https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-...

I guess this is a kind of geometric representation on the order of Shannon's Theseus.

1 more reply

thomastjeffery3y ago

And that is exactly how this works.

There is no way to represent the state of the game without some kind of board model.

So any coherent representation of a sequence of valid game states can be used to infer the game board structure.

1 more reply

nl3y ago

> I must have missed the part when it started doing anything algorithmically.

Yeah.

"Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers"

https://arxiv.org/abs/2212.10559

@dang there's something weird about this URL in HN. It has 35 points but no discussion (I guess because the original submission is too old and never got any traction or something)

naasking3y ago

> I must have missed the part when it started doing anything algorithmically. I thought it’s applied statistics, with all the consequences of that.

This is a common misunderstanding. Transformers are actually Turing complete:

* On the Turing Completeness of Modern Neural Network Architectures, https://arxiv.org/abs/1901.03429

* On the Computational Power of Transformers and its Implications in Sequence Modeling, https://arxiv.org/abs/2006.09286

stefl143y ago

naasking3y ago

> Turing Completeness is an incredibly low bar and it doesn't undermine this criticism.

It does. "Just statistics" is not Turing complete. These systems are Turing complete, therefore these systems are not "just statistics".

> or at the very least have a good go at Chollet's ARC challenge, which is trivial for humans.

I think you're overestimating humans here.

creatonez3y ago

What do you mean by "algorithmically"? Gradient descent of a neural network can absolutely create algorithms. It can approximate arbitrary generalizations.

mr_toad3y ago

> but AGI claims really seem exaggerated.

What AGI claims? The article, and the comment you’re responding to don’t say anything about AGI.

jafitc3y ago

Google: emergent capabilities of large language models

bitexploder3y ago

What if our brains are just carefully arranged statistical inference machines?

make33y ago

it definitely learns algorithms

omniglottal3y ago

It's worth emphasizing that "is able to reproduce a representation of" is very much different from "learns".

sirsinsalot3y ago

Why is it? If I can whiteboard a depth first graph traversal without recursion and tell you why it is the shape it is, because I read it in a book ...

Why isn't GPT learning when it did the same?

1 more reply

make33y ago

arguably your brain also learns a representation of an algorithm too

cultureswitch3y ago

Epistemologically wrong

Semioj3y ago

We don't do something different.

We either repeat like a parrot (think about kids who you though got something and then you discover they didn't understood it)

Or create a model (as chatgpt does) of abstraction and then answer through it.

1 more reply

Semioj3y ago

It's fantasy wide now closer than before because of this huge window it just can handle.

That already feels closer to short-term memory.

Which begs the question how far are we?

Applejinx3y ago

Um… I have a lossy-compressed copy of DISCWORLD in my head, plus about 1.3 million words of a fanfiction series I wrote.

snovv_crash3y ago

They check for Mandela Effect issues on the linked page. GPT-4 is a lot better than 3.5. They demo it with "Can you teach an old dog new tricks?"

graboid3y ago

> Um… I have a lossy-compressed copy of DISCWORLD in my head, plus about 1.3 million words of a fanfiction series I wrote.

You mean word-for-word in your head? That's pretty impressive. Are you using any special technique?

sebzim45003y ago

I assume not, that's why he said 'lossy'.

geysersam3y ago

It costs something like 0.03-0.06 cents per thousand tokens. So for 32k that's about $1-3 for reading and another $1-3 for the response.

So sure, still cheap for a doctor appointment, but not pennies. Do it 30 times per hour and you could've just hired a consultant instead.

zachthewf3y ago

Cost of ChatGPT API just dropped 90%. Guaranteed that prices will come down dramatically over time.

tzekid3y ago

I don't get why this comment is downvoted. Basically this.

A halving of the costs every year or so seems realistic in this emerging phase.

hackernewds3y ago

Yet in a capitalist society, against business interests. Look at how Snowflake (the data warehousing company) is driven now, vs before they were public

1 more reply

Semioj3y ago

You still could not.

Chatgpt could in theory have the knowledge of everything written while your consultant can't.

geysersam3y ago

ericpauley3y ago

Further, a consultant couldn’t meaningfully interpret 50 pages in 2 minutes, even with the most cursory skimming.

hackernewds3y ago

An LLM can never offset a consultants diverse duties though. Some, maybe. However you cannot run healthcare with 90% specificity

m3affan3y ago

The power openai will hold above everyone else is just too much. They will not allow their AI as a service without data collection. That will be a big pill to swallow for the EU.

sebzim45003y ago

>They will not allow their AI as a service without data collection

They already allow their AI as a service without data collection, check their TOS.

geysersam3y ago

The stuff people make up in this thread is just ridiculous.

pjgalbraith3y ago

Definitely seems like it's not just GPT-4 that can hallucinate facts.

1 more reply

PoignardAzur3y ago

Almost every answer in the thread was "this guy isn't that smart, this is obvious, everybody knew that", even though comments like the above are commonplace.

dTal3y ago

Years? There are already competitors. I just spent all evening playing with Claude (https://poe.com/claude) and it's better than davinci-003.

hackernewds3y ago

how do you run LLaMa on a phone?

1 more reply

karmasimida3y ago

It could take about a year or so.

But I think you should forget about self-hosting at this point, the game is up.

peterashford3y ago

Yeah, there's an awful lot of power going into private hands here and as Facebook & Twitter have shown, there can be consequences of that for general society.

gwright3y ago

> Yeah, there's an awful lot of power going into private hands

That sounds scary, but what do you mean by "power"? Honest question, I'm fascinated by the discussion about learning, intelligence, reasoning, and so on that has been spawned by the success of GPT.

What "power" do you imagine being wielded? Do you think that power is any more dangerous in "private hands" than the alternatives such as government hands?

peterashford3y ago

leshow3y ago

lmao

p1esk3y ago

AStrangeMorrow3y ago

hackernewds3y ago

Where is the Stable diffusion equivalent of ChatGPT though?

1 more reply

bboylen3y ago

Yep

OpenAI doesn't have some secret technical knowledge either. All of these models are just based on transformers

standardUser3y ago

geysersam3y ago

> They will not allow their AI as a service without data collection.

Why wouldn't they? If someone is willing to pay for the privilege of using it.

int_is_compress3y ago

There’s already project that help with going beyond the context window limitation like https://github.com/jerryjliu/llama_index

They also just tweeted this to showcase how it can work with multimodal data too: https://twitter.com/gpt_index/status/1635668512822956032?s=4...

light_hue_13y ago

> As a professional...why not do this? There's a non-zero chance that it'll find something fairly basic that you missed and the cost is several cents.

Everyone forgets basic UI research. "Ironies of Automation", Bainbridge, 1983. The classic work in the space.

dinkumthinkum3y ago

This is such a great point.

rasz3y ago

>A doctor can put an entire patient's medical history in the prompt

HIPAA violation https://www.hhs.gov/hipaa/for-individuals/index.html

>a lawyer an entire case history, etc.

lawyer client confidentiality violation https://criminal-lawyers.ca/2009/07/31/the-lawyers-duty-of-c...

hattmall3y ago

lukev3y ago

Unlike information embedded in the parameters, a LLM has the capability to "cite its source" for information in the context window.

matheusmoreira3y ago

> As a professional...why not do this?

Unless GPT-4 is running locally on our own computers, there's absolutely no way dumping a patient's entire medical history into this thing could possibly be considered ethical or legal.

textninja3y ago

> there's absolutely no way dumping a patient's entire medical history into this thing could possibly be considered ethical

Conversely, the output could mislead the doctor sufficiently to cost the patient their life, so I’m not making any absolute statements either ;)

humanistbot3y ago

> we can easily imagine a scenario where patient consent is obtained and the extra computational analysis provides life-saving insight

stbede3y ago

Increasingly, medical history includes genetic information. Because of the nature of genetics, your private healthcare data includes data about your parents, siblings, etc.

> Dropping patient history into this thing is incredibly ill-advised.

It's illegal

selfhoster113y ago

If my doctor did this without my express knowledge and consent, I'd be looking for a new doctor faster than you can say "f*ck no, absolutely not".

textninja3y ago

Of course, this is illegal, so the ethics are moot; even if such technology would save my life, there is no way the hospital would accept the liability.

LightBug13y ago

New doctor?

I think you mean, new lawyer.

matheusmoreira3y ago

Absolutely not. This is not an extreme view.

erfgh3y ago

Unless the patient agrees. I know that for most things that can go wrong with me I wouldn't have a problem with people knowing.

saiya-jin3y ago

There are whole areas of human existence which are protected by laws, and in no way data can be pushed into external (US-based) machine.

yunwal3y ago

I'm sure at some point OpenAI will start signing BAAs

bagels3y ago

A doctor doesn't do this because of ethics and HIPAA. I'm sure lawyers aren't so keen on sharing privileged information that would compromise their case either.

sleigh-bells3y ago

For legal research, lawyers already use third party sites like Westlaw. You can do legal research without giving up any confidential client information.

autograd03y ago

Still an incredible leap forward in AI technology, but I disagree with the implication that the best experts in a field are simply loading words from some text and reasoning with and manipulating it.

hLineVsLineH3y ago

The comparison between the context length and what humans can hold in their heads just seems faulty.

Not to mention that humans don't need to hold the entire report in their head because they can hold it in their hand and look at it.

pstorm3y ago

Reading the press release, my jaw dropped when I saw 32k. The workaround using a vector database and embeddings will soon be obsolete.

teaearlgraycold3y ago

That’s like saying we’ll not need hard drives now that you can get bigger sticks of RAM.

nl3y ago

> The workaround using a vector database and embeddings will soon be obsolete.

This is 100% not the case. Eg I use a vector database of embedding to store an embedding of every video frame which I later use for matching.

There are many NLP-only related tasks this helps for but equally as many that still require lookup and retrieval.

pstorm3y ago

nl3y ago

I think LangChain will be more important.

The GPT-4 paper even has an example of this exact approach. See section 2.10:

The red teamer augmented GPT-4 with a set of tools:

• A molecule search tool (performs a webquery to PubChem to get SMILES from plain text)

• A web search

• A purchase check tool (checks if a SMILES21 string is purchasable against a known commercial catalog)

• A chemical synthesis planner (proposes synthetically feasible modification to a compound, giving purchasable analogs)

siva73y ago

Quite the contrary. Utilising such libs makes GPT-4 even more powerful to enable complex NLP workflows which will likely be a majority of real business use cases in the future.

bick_nyers3y ago

What about an AI therapist that remembers what you said in a conversation 10 years ago?

gwd3y ago

One solution would be to train the AI to generate notes to itself about sessions, so that rather than reviewing the entire actual transcript, it could review its own condensed summary.

bick_nyers3y ago

pstorm3y ago

If the context window grows from 32k to 1m, maybe the entire history would fit in context. It could become a cost concern though.

jbkkd3y ago

I'd be willing to pay good money for a 1m limit.

justanotheratom3y ago

Cost is still a concern, so workarounds to reduce context size are still needed

pstorm3y ago

siva73y ago

I don't see how. Can you elaborate?

graboid3y ago

Do you think this will be enough context to allow the model to generate novel-length, coherent stories?

I expect you could summarize the preceding, already generated story within that context, and then just prompt for the next chapter, until you reach a desired length. Just speculating here.

The one thing I truly cannot wait for is LLM's reaching the ability to generate (prose) books.

r0uv3n3y ago

BarryMilo3y ago

What's interesting about AI-generated books? Apart from their novelty factor

grumbel3y ago

graboid3y ago

I mean, if it is a genuinely good book, I don't care about authorship. Death of the author etc.

Because someone mentioned Kafka: I'd like to know what Kafka's The Trial written in the style of a PKD novel would be like.

zirgs3y ago

What if I'm a huge fan of Jules Verne or Arthur Conan Doyle. I want new books from them, but the problem is that they're long dead.

AI that's trained on their style could give me what I want.

GRRM fans also should probably think of the ways to feed ASOIF to the AI if they want to know how it ends.

dinkumthinkum3y ago

2 more replies

circlemaker3y ago

If the book is actually good, then what is interesting about it is that it would still be about something that humans find important and relevant, due to the LLM being trained on human cultural data.

geysersam3y ago

Good question! It'd be really cool, but there are already more high quality books out than I'll be able to read in my lifetime.

chaos_emergent3y ago

You could also do hierarchical generation just like OpenAI proposes doing hierarchical summarization in this post -- https://openai.com/research/summarizing-books

jchanimal3y ago

It wasn’t that hard to work in chunks and write a book on GPT-3, can only be easier. https://docs.google.com/document/d/1vx6B6WuPDJ5Oa6nTewKmzeJM...

hooandeOP3y ago

I've seen that it can also generate 25k words. That's about 30-40% of the average novel

hattmall3y ago

Couldn't you feed it the first 25k words and tell it to continue the story?

Sharlin3y ago

If its context size is >= 25k words, yes. Otherwise it will just discard the start of the prompt. And it’s a sliding window, so the more it generates, the more it forgets.

1 more reply

billythemaniam3y ago

hooandeOP3y ago

Yes. It still can't attend meetings, collaborate on projects or set priorities. Or any of the other things programmers spend most of their time doing.

raincole3y ago

All we can do is guessing now until more people get access to the new API. My bet is it can at least generate documentation pretty well.

varenc3y ago

I think anyone that pays $20/month for ChatGPT plus has immediate access? At least I already have access now. I’m assuming new subscribers get access too.

raincole3y ago

As far as I can tell, ChatGPT plus is the 8096 tokens version. The 30k token version is only available via API. I might misread it tho, it's not super clear on their site.

Are you sure you are accessing the 30k token version via ChatGPT plus?

1 more reply

ElijahLynn3y ago

I have the Plus plan and it just asked me if I wanted to try it. And currently it is limiting requests for ChatGPT-4 and displays this in the UI.

"GPT-4 currently has a cap of 100 messages every 4 hours"

voldacar3y ago

>As a professional...why not do this?

because "open"AI logs everything that goes in and out of the model?

kristjansson3y ago

> lawyer an entire case history

~50 pages is ... not the entire history of most cases.

SergeAx3y ago

Please. Language model cannot "reason", it can just show next most probable word based on text corpus downloaded from the internet.

fastball3y ago

What do you mean by "next most probable word"? How do you calculate the probabilities of words appearing in a sentence that has never actually existed?

SergeAx3y ago

You take the prompt and caclulate what next word after the prompt is most probable. Like T9 with letters, but bigger.

RugnirViking3y ago

The whole reason there is an AI here is because a markov chain, which is what you are describing, doesn't work beyond one or two word horizons.

1 more reply

snickmy3y ago

oska3y ago

> it will drastically expand the reasoning capability

> it can algorithmically do reasoning

How is this machine doing reasoning ?

Why do you think you can use this word in this context ?

barogptinfi3y ago

It could only solve 3/45 Leetcode hards, I think people are giving the current iteration of this AI far too much credit for what they imagine future iterations to be.

cornholio3y ago

It can solve exactly zero leetcode easies if they are original problems seen for the first time for which no training data exists.

Semioj3y ago

What is reasoning?

Is it perhaps having a consistent model which allows you to describe every point of it?

Because if chatgpt can explain itself based on a model it builds with it than I actually does reasoning

xwolfi3y ago

No that's self awareness, reasonning is more applying logical rules to inputs to produce a logical output (inputs can be "wrong" though)

Sharlin3y ago

Self awareness is having a model of self. Having a model of something non-self has absolutely nothing to do with self-awareness.

andrepd3y ago

> If you can put in ALL of the information relevant to a problem and it can algorithmically do reasoning

astrange3y ago

Pretty easy to get it to abstract, just ask it.

Q: List possible abstractions encompassing all these specific concepts: singing, whistling, yodeling.

andrepd3y ago

That's literally a text search task. That's not what I mean, I mean things like understanding the rules of addition from examples, the rules of logic from examples, or the rules of chess.

gwd3y ago

[1] https://thegradient.pub/othello/

astrange3y ago

It can't search text. It doesn't have access to any text. Anything it does works in a different way than that.

It is sometimes able to do other tasks, but unlike humans (or "AGI") it has a completely fixed compute budget and can't pause to think in between outputting two tokens.

(Btw, I tried to get it to derive addition from two 1-digit examples but couldn't.)

upwardbound3y ago

My biggest concern is that GPT-4 is still a black box model to a large extent, and trying to safeguard something without understanding the exact purpose of each neural circuit.

Source: My startup team (Preamble, Inc.) discovered the Prompt Injection attack category, which still affects all models including GPT-4.

There are many, many, many ways to hide prompt attacks in data that you might at first think you can trust but you really can’t.

Then all an attacker has to do is live in that town to inject the string. There are already all kinds of strange town names, like “Truth or Consequences” which is a real city in New Mexico.

amrb3y ago

HIPAA fines will sink you so fast, unless they be hosting it dedicated.

Hoyadonis3y ago

Tl;dr: When it comes to HIPAA, risk aversion is the name of the game.

Sharlin3y ago

textninja3y ago

> What % of people can hold 25,000 words worth of information in their heads, while effectively reasoning with and manipulating it?

In the general case, for arbitrary input, I think the answer to this is clearly 0. At best we can compress the text into a limited embedding with a few salient points stored in long term memory.

trashtester3y ago

I'm pretty sure one could formulate way more than 25k words worth of propositions, where you would be able to determine if the proposition is true or not. This is due to your long term memory.

The GPT string is closer to short term memory, and there 25k words is way more than a human is capable of.

But a human author can offload much storage to long term (or some intermediate) memory.

jacquesm3y ago

What happens with the prompts that you enter into OpenAI? I believe each and every one of those will be saved. And even if they swore that they did not would you trust them?

If my lawyer or doctor put my case history into OpenAI and I would find out about it I would definitely sue them for breach of confidentiality.

cactusplant73743y ago

polishdude203y ago

Imagine giving this a bunch of papers in all sorts of fields and having it do a meta analysis. That might be pretty cool.

polishdude203y ago

What will happen is it won't be the "Second Opinion Machine". It'll be the "First Opinion Machine". People are lazy. They will need to verify everything.

nullc3y ago

> As a professional...why not do this?

Because of confidentiality.

unyttigfjelltol3y ago

Because it's harder to correct subtle errors from an ad-lib generator than it is to construct a correct analysis in the first instance.

nullc3y ago

Agreed but there is safe(er) way to use it that large addresses that concern:

This is not different than having a lesser expert check your document (e.g. THE CLIENT!), except the machine time is very close to free and may be even better at catching far off concepts.

08983y ago

When will the longer context length be available through ChatGPT Plus? Have they said yet?

jalapenos3y ago

The length is the main bottleneck right now.

I'm running whatever I can through this right now. It's doing what Google was doing, i.e. clues, but on steroids.

As soon as the length hits codebase size territory we're in yet greater frontiers.

d--b3y ago

Who says GPT has the ability to hold 25,000 token in its "head"?

You can send 25000 random words in the prompt and asks GPT how many pairs of words share at least one letter. I doubt that the answer will be correct...

moconnor3y ago

Why? I'm pretty sure it could do this kind of task - attention is computed between all pairs of tokens. Yes, it's a lot of compute.

criddell3y ago

Surely GPT could write a program to count pairs of words that share at least one letter, right? Maybe GPT-5 will be able write and run programs on the fly to answer questions like this.

faeriechangling3y ago

> As a professional...why not do this?

I would love to but openai’s privacy policies makes it a huge ethics, privacy, and security breach. I’m interested in running Facebook’s model just as a workaround to this fundamental issue.

bitL3y ago

I am surprised they allow only 32k tokens when Reformer can have context length of 1M on 16GB VRAM. It seems like they have some ways to optimize it further.

geysersam3y ago

Is the Reformer as capable as this model? It's a trade-off.

bitL3y ago

leshow3y ago

> A doctor can put an entire patient's medical history in the prompt, a lawyer an entire case history, etc.

you don't see a real problem there?

bilqis3y ago

I think you’re making a huge assumption and a mistake when you say “reasoning” in context of gpt. It does not reason, nor think.

imranq3y ago

There's less and less relevant data with longer documents, so I would expect performance wouldn't change much

mercurialsolo3y ago

Couldn't the same be done by breaking the conversation down into chunks and adding the context incrementally?

Madmallard3y ago

GPT is censored with respect to medical diagnosis

lock-the-spock3y ago

The lawyer can enter their entire brief and get back the brief the other side's lawyer uploaded in her own brief an hour earlier.

No one can trust the AI.

rodolphoarruda3y ago

Yep, butlerian jihad feelings about this.

marsven_4223y ago

"expand the reasoning" there is no reasoning going on here!

It's all statistical word generation aka math!

And this is not how humans "work" our brain are not computers running software. We are something else.

j / k navigate · click thread line to collapse