OpenAI shuts down its AI Classifier due to poor accuracy (opens in new tab)

(decrypt.co)

503 pointscbowal2y ago283 comments

283 comments

I'm glad that they did, although they should obviously done an announcement for it.

The amount of people in the ecosystem who thinks it's even possible to detect if something is AI written or not when it's just a couple of sentences is staggering high. And somehow, people in power seems to put their faith in some of these tools that guarantee a certain amount of truthfulness when in reality it's impossible they could guarantee that, and act on whatever these "AI vs Human-written" tool tell them to.

So hopefully this can serve as another example that it's simply not possible to detect if a bunch of characters were outputted by an LLM or not.

feoren2y ago

Indeed it's not possible. Say you had a classifier that detected whether a given text was AI generated or not. You can easily plug this classifier into the end of a generative network trying to fool it, and even backpropagate all the way from the yes/no output to the input layer of the generative network. Now you can easily generate text that fools that classifier.

So such a model is doomed from the start, unless its parameters are a closely-guarded secret (and never leaked). Then it means it's foolable by those with access and nobody else. Which means there's a huge incentive for adversaries to make their own, etc. etc. until it's just a big arms race.

It's clear the actual answer needs to be: we need better automated tools to detect quality content, whatever that might mean, whether written by a human or an AI. That would be a godsend. And if it turned into an arms race, the arms we're racing each other to build are just higher-quality content.

asddubs2y ago

The whole problem with AI is that it's able to copy some of the superficial indicators of quality content while feeding you lies. You cannot detect quality content without detecting truthfulness. Any heuristic you use in place of that can be copied without actually providing value (which is exactly what ChatGPT does now, when it gets things wrong)

1 more reply

tessierashpool2y ago

> You can easily plug this classifier into the end of a generative network trying to fool it, and even backpropagate all the way from the yes/no output to the input layer of the generative network. Now you can easily generate text that fools that classifier

could you contextualize your use of the word "easily" here?

I feel like "easily" might mean "with infinite funds and frictionless spherical developers."

1 more reply

sobellian2y ago

Even without a GAN, it's quite possible that one could simply write `while(rejects(output)) { output = gpt(prompt) }` and obtain a sufficient output after a reasonable number of iterations.

dreamcompiler2y ago

You just outlined an excellent proof of what might be called the AI Halting Problem.

constantcrying2y ago

Even the idea of it is bad, ChatGPT is supposed to write indistinguishably from a human.

The "detector" has extremely little information and the only somewhat reasonable criteria are things like style, where ChatGPT certainly has a particular, but by no means unique writing style. And as it gets better it will (by definition) be better at writing in more varied styles.

bko2y ago

Copying a comment I posted a while ago:

I listened to a podcast with Scott Aaronson that I'd highly recommend [0]. He's a theoretical computer scientist but he was recruited by OpenAI to work on AI safety. He has a very practical view on the matter and is focusing his efforts on leveraging the probabilistic nature of LLMs to provide a digital undetectable watermark. So it nudges certain words to be paired together slightly more than random and you can mathematically derive with some level of certainty whether an output or even a section of an output was generated by the LLM. It's really clever and apparently he has a working prototype in development.

Some work arounds he hasn't figured out yet is asking for an output in language X and then translating it into language Y. But those may still be eventually figured out.

I think watermarking would be a big step forward to practical AI safety and ideally this method would be adopted by all major LLMs.

That part starts around 1 hour 25 min in.

> Scott Aaronson: Exactly. In fact, we have a pseudorandom function that maps the N-gram to, let’s say, a real number from zero to one. Let’s say we call that real number ri for each possible choice i of the next token. And then let’s say that GPT has told us that the ith token should be chosen with probability pi.

https://axrp.net/episode/2023/04/11/episode-20-reform-ai-ali...

5 more replies

siglesias2y ago

I'd challenge this assumption. ChatGPT is supposed to convey information and answer questions in a manner that is intelligible to humans. It doesn't mean it should write indistinguishably from humans. It has a certain manner of prose that (to me) is distinctive and, for lack of a better descriptor, silkier, more anodyne, than most human writing. It should only attempt a distinct style if prompted to.

2 more replies

insanitybit2y ago

I tried an experiment when GPT4 allowed for browsing. I sent it my website and asked it to read my blog posts, then to write a new blog post in my writing style. It did an ok job. Not spectacular but it did pick up on a few things (I use a lot of -'s when I write).

The point being that it's already possible to change ChatGPT's tone significantly. Think of how many people have done "Write a poem but as if <blah famous person> wrote it". The idea that ChatGPT could be reliably detected is kind of silly. It's an interesting problem but not one I'd feel comfortable publishing a tool to solve.

toss12y ago

Yup.

Moreover, the way to deal with AI in this context is not like the way to deal with plagiarism; do not try to detect AI and punish its use.

Instead, assign it's use, and have the students critique the output and find the errors. This both builds skills in using a new technology, and more critically, builds the essential skills of vigilance for errors, and deeper understanding of the material — really helping students strengthen their BS detectors, a critical life skill.

1 more reply

Teever2y ago

Nitpick: ChatGPR is supposed to write in a way that is indistinguishable from a human, to another human.

That doesn't mean that it can't be distguishable by some other means.

3 more replies

WiSaGaN2y ago

The default style people cites about ChatGPT is also nothing intrinsic about AI, it is just this paticular AI is trained and prompted to output information in this way. The output style can change drastically with just a little prompt change even on the user side.

RandomLensman2y ago

Why even care if it is written by a machine or not? I am not sure it matters as much as people think.

3 more replies

catboybotnet2y ago

There's also the post going around about how it can (and does) falsely flag human posts as AI output, particularly among some autistic people. About as useful as a polygraph, no?

capableweb2y ago

Both false-positives are as useful as the other one, flagged "human" but actually "LLM" vs flagged "LLM" but actually "human". As long as no one put too much weight on the result, no harm would have been done, in either case. But clearly, people can't stay away from jumping to conclusions based on what a simple-but-incorrect tool says.

3 more replies

hef198982y ago

We could combine those, couldn't we?

2 more replies

LordDragonfang2y ago

TBH, a properly-administered polygraph is probably more accurate than OpenAI's detector (of course, "properly administered" requires the subject to be cooperative and answer very simple yes or no questions, because a poly measures subconscious anxiety, not "truth")

1 more reply

BestGuess2y ago

Taking away tools don't seem to me like the best response same way taking away things tends never to be. If the problem is people not using it right, that seems to me like it would be designed wrong for what people need it for. Like if the issue is using it wrong with too little sentences, then put a minimum sentence or something to have that minimum likelihood.

Same goes for representing what it means. If people don't understand statistics or math and such, then show what it means with circles or coins or stuff like that. Point is don't seem ever a good thing for options to get removed, especially if it's for bein cynical and judgin people like they're beneath deservin it. Don't make no sense.

insanitybit2y ago

The problem isn't people not using it right, the problem is that the tool can never work and just by being out in the world it would cause harm.

If I have a tool that returns a random number between 0 and 1, indicating confidence that text is AI generated, is that tool good? Is it ethical to release it? I'd say no, it isn't. Removing the option is far better because the tool itself is harmful.

2 more replies

cjbgkagh2y ago

There are numerous people that I’ve tried to get them comprehend statistics, important medical statistics for doctors so you would assume they’re smart enough to understand. There just seems to be a sufficient subset of the population that are blind to statistics and nothing can be done about it. Even sitting down and carefully going through the math with them doesn’t work. No matter how deep into visualization rabbit hole you go there will still be a subset that will not get it.

1 more reply

RandomLensman2y ago

I think it will be increasingly irrelevant what specific process generated a text, for example. Already before genAI people did not in general query into how politicians' speeches were crafted etc.

2 more replies

andy992y ago

> The amount of people in the ecosystem who thinks it's even possible to detect if something is AI written or not when it's just a couple of sentences is staggering high.

I saw that this report came out today which frankly is baffling: https://gpai.ai/projects/responsible-ai/social-media-governa... (Foundation AI Models Need Detection Mechanisms as a Condition of Release [pdf])

specproc2y ago

I'm still interested in this line of enquiry.

These models are clearly not good enough for decision-making, but still might tell an interesting story.

Here's an easily testable exercise: get a load of news from somewhere like newsapi.ai, run it through an open model and there should be a clear discontinuity around ChatGPT launch.

We can assume false positives and false negatives, but with a fat wadge of data we should still be able to discern trends.

Certainly couldn't accuse a student of cheating with it, but maybe spot content farms.

xattt2y ago

I see no reason why watermarking can’t be broken by having someone simply rephrase/redraw the output.

Yes, it’s still work, but it’s one step removed from having to think up of the original content.

whimsicalism2y ago

Watermarking was never going to be successful except for the most naive uses.

1 more reply

nancyhn2y ago

I agree, transparency is essential, especially when it comes to AI applications. Many underestimate the complexity of distinguishing AI-written content from human-written, especially for short texts. There's a danger in trusting tools claiming to provide absolute certainty in this regard; no current technology can guarantee 100% accuracy. This incident underscores the need for a more realistic understanding of AI capabilities and limitations in text generation detection.

nradov2y ago

Dr. Michio Kaku claimed in an interview that it may eventually be possible for quantum computers to guarantee a certain level of truthfulness. I didn't really follow his argument and it seemed a little hand-wavey, but I can't prove that he's wrong.

https://mkaku.org/home/tag/quantum-computing/

c-cube2y ago

No need to prove! Any well calibrated bullshit detector should ring at 110dB when applied to claims mixing AI and quantum computing.

(that said, "may eventually be possible" is so weak a claim it's already meaningless. Quantum fluctuations may eventually turn me into a potato but it's not keeping me up at night)

airjtaiwjt2y ago

Based on my experience from grad school, I would bet plenty of the professors who fail students because ChatGPT said ChatGPT might have written something honestly don't care whether it's true or not, as long as it shifts liability away from themselves onto someone else

amelius2y ago

They could certainly keep a database of things generated by /their/ AI ...

gmerc2y ago

Which would be trivially broken with emojis injection or viewpoint shifting.

1 more reply

burtonator2y ago

It's really disturbing to me how many people don't realize it's not possible.

It's like asking a 747 to be made into a dog.

It's completely nonsensical to me.

hackernewds2y ago

Good. If it is not reliable, it's a further harm than good if it exists as a false security.

An analogous example: my local pizza delivery (where I worked) would shut the box with a safety sticker, to avoid tampering / dipping by the delivery boys. Now, sometimes they would forget to do this for various logistical reasons. Every one of the non-stickered ones started getting returned as customers worried a pepperoni stolen. They stopped doing it shortly after.

klabb32y ago

It’s a law of nature that pepperoni thieves cannot take a job at a pizza place. They are forever doomed to be delivery guys.

boondoggle162y ago

This is actually kinda true with doordash etc. Those drivers are completely unvetted, they don't even have an interview.

The kind of people that can't get a job at a pizza place.

Personally, I never order delivery through these services. The incentives are all wrong. Not to mention the costs are super high: restaurants don't make any money, I pay out the @$$, and the drivers are given sub-minimum-wage pay after taking on the risks of delivery driving.

pixl972y ago

Eh, I'd consider that a failure of employee training and reverse the situation by giving out a weekly bonus to shifts that did not fail to put the security stickers on.

Kinda like if they forgot to put the security seal on your aspirin, I'm not going to take them all off because someone forgot to run production with all the bottles sealed.

frumper2y ago

The bottle of Aspirin goes through many hands between the manufacturer and you including sitting on an unattended shelf open to the public. The person making the pizza is working for the same company as the person delivering it, or may even be the same person. If you can't trust the pizza co delivery person then you probably shouldn't trust the person making it either.

2 more replies

mplewis2y ago

Frankly, the kind of person who forgets to put the sticker on at the pizza place will forget about the bonus too.

chakintosh2y ago

This tool's been fueling tons of false accusations in academia. Wife is doing her PhD and she often tells me stories about professors falsely accusing students of using ChatGPT.

practice92y ago

Lots of stories on Reddit about school teachers unfairly accusing students of using ChatGPT for assignments too

Qqqwxs2y ago

A group of students at my university were claiming their papers were being marked by a LLM. They cited a classifier like the OP which they used on their feedback comments.

paxys2y ago

It isn't just this one. There are a hundred different "AI detectors" sold online that are all basically snake oil, but overzealous professors and school administrators will keep paying for them regardless.

VeninVidiaVicii2y ago

Eh I am doing my PhD and I use ChatGPT all the time!

gigglesupstairs2y ago

And?

The tool in question was used for AI text detection not generation.

1 more reply

Grimblewald2y ago

whats funny is GPT output with even a relatively low temp will come back as human more reliably than human content.

jedberg2y ago

Latest I heard is that teachers are requiring homework to be turned in in Google Docs so that they can look at the revision history and see if you wrote the whole thing or just dumped a fully formed essay into GDocs and then edited it.

Of course the smart student will easily figure out a way to stream the GPT output into Google Docs, perhaps jumping around to make "edits".

A clever and unethical student is pretty much undetectable no mater what roadblocks you put in their way. This just stops the not clever ones. :)

andromaton2y ago

Proof of work, human version.

Yes, anybody can write an agent to meander about typing the chatgpt generated text into Google docs. Yes, Google could judge how likely it's that a document was typed by a human, but they won't for the same reasons openAI just cancelled this.

Somebody (maybe reacting to this news, maybe reading this thread) will write such an editor or evaluator. Another solution is screen recording as you write. Another (the best one, and the hardest one for educators) is to not request or grade things a robot can write better than most humans.

marcell2y ago

This will be hard to break. It’s basically an hour long CAPTCHA. You can look at things like key stroke timing, mouse movement, revision pattern, etc. I don’t see LLM’s breaking this approach to classify human writing.

jacquesm2y ago

> I don’t see LLM’s breaking this approach to classify human writing.

Why not? Record a bunch of humans writing, train model, release. That's orders of magnitude simpler than to come up with the right text to begin with.

1 more reply

TulliusCicero2y ago

Seems easy to me? Just manually copy the text by typing it in yourself. There may be certain patterns that could potentially give it away vs truly human-generated text, but will Google Docs revision history show that level of detail?

ribosometronome2y ago

Retyping the essay from the chatgpt while actively rewording the occassional sentence seems like it would do it.

wmeredith2y ago

It seems like that's nearing the sweet spot of fraud prevention, where committing the act of fraud is as much work as doing the real thing.

4 more replies

mplewis2y ago

It's a bit suspicious to type an essay linearly from start to finish, though.

3 more replies

justrealist2y ago

Well, there is one way, which is timed, proctored exams.

Which sucks, because take-home projects are evaluating a different skill set, and some people thrive on one vs the other. But it is what it is.

matteoraso2y ago

>Of course the smart student will easily figure out a way to stream the GPT output into Google Docs, perhaps jumping around to make "edits".

No need to complicate it that much. Just start off writing an essay normally, and then paste in the GPT output normally. A teacher probably isn't going to check any of the revision history, especially if there's more than 30 students to go through.

jwie2y ago

Easy thing to do would be very low maximum word counts. ChatGPT is incapable of brevity.

burtonator2y ago

It's like hearing stories about people hating the first automobiles because they weren't horses.

The education bubble is about to implode - it will probably be one of the first industries killed by AI.

13years2y ago

"Half a year later, that tool is dead, killed because it couldn’t do what it was designed to do."

This was my conclusion as well testing the image detectors.

Current automated detection isn’t very reliable. I tried out Optic’s AI or Not , which boasts 95% accuracy, on a small sample of my own images. It correctly labeled those with AI content as AI generated, but it also labeled about 50% of my own stock photo composites I tried as AI generated. If generative AI was not a moving target I would be optimistic such tools could advance and become highly reliable. However, that is not the case and I have doubts this will ever be a reliable solution.

from my article on AI art - https://www.mindprison.cc/p/ai-art-challenges-meaning-in-a-w...

danuker2y ago

> but it also labeled about 50% of my own stock photo composites I tried as AI generated

Could it be that a large proportion of the source stock photos were actually AI generated?

13years2y ago

No, they were older images. However, that is now becoming a problem. Some stock photo sites now have AI images and they are not labeled. I'm able to distinguish most for now because at hires the details contain obvious errors.

This is really painful, because for some of my work I need high quality images suitable for print. Now I can't just look at the thumbnail and say "this will work". I now have to examine it taking more of my time.

sakopov2y ago

Humorously, in my experience, if a response from ChatGPT ever got classified as AI generated by tools like ZeroGPT or similar, all I had to do was adjust the prompt to tell the model not to sound like it was AI generated and that bypassed all detection with a very high success rate. Additionally, I also found that if you prompt it to make the response be in the style or some known writer for example, it often made responses 100% human written by most AI detection models.

klabb32y ago

“Can’t you just try to blend in and be a little more cool? The bouncer is gonna notice.”

Starts talking like Shakespeare

muzani2y ago

This didn't work half a year ago. They'd still accuse the rewritten text to be AI generated. I think some of the recent updates have changed the tone of ChatGPT so significantly that they no longer register on the radar.

rootusrootus2y ago

Good. And I think watermarking AI output is also a dead end. Better that we simply assume that all content is fake unless proven otherwise. To the extent that we need trustworthy photos, it seems like a better idea to cryptographically sign images at the hardware level when the photo is taken. Voluntarily watermarking AI content is completely pointless.

sebzim45002y ago

I can see that working for specialized equipment like police body cameras, but if every camera manufacturer in the world needs to manage keys and install them securely into their sensors then there will be leaked keys within weeks.

tudorw2y ago

Makes fake image, hold it in front of camera, click, verified image...

1 more reply

ummonk2y ago

Just use a certificate chain. The manufacturer can provide each camera its own private key, signed by the manufacturer.

1 more reply

cwkoss2y ago

Cryptography cant save us here, people will figure out how to send AI images to the crypto hardware to get it signed in months. Just would be another similar layer of false security.

malfist2y ago

That's not how cryptographic signing works.

Cryptographic signing means "I wrote this" or "I created this". Sure you could sign an AI generated image as yourself. But you could not sign an image as being created by Getty or NYT

1 more reply

baby_souffle2y ago

> Cryptography cant save us here, people will figure out how to send AI images to the crypto hardware to get it signed in months.

Possibly (who am I kidding. *PROBABLY*!) will use chatGPT to help them design the method :)

TheCaptain48152y ago

I'm in the SEO game and I've spoken with some 'heavy players' who believe a "Google Ai Update" is in the works. As it currently stands, the search engine results will be completely overtaken by Ai content in the near future without this.

From my understanding, this is a fools play in the long run, but there are current Ai Classifier Detectors that can successfully detect ChatGPT and other models (Originality.ai being a big one) on longish content.

Their process is fairly simple, they create a classification model after generating tons of examples from all the major models (ChatGPT, GPT4, Laama, etc).

One obvious downside to their strategy is the implementation of Finetuning and how that changes the stylistic output. This same 'heavy hitter' has successfully bypassed Originalities detector using his specified finetuning method (which he said took months of testing and thousands of dollars).

bearjaws2y ago

Google needs to do a full 180, and only the most succinct website that answers search queries should be elevated.

The current state of Google is a disaster, everything is 100 paragraphs per article, the answer you are looking for buried half way in to make sure you spend more time and scroll to appease the algorithm.

I cannot wait for them to sink all these spam websites.

pixl972y ago

Waiting for Google to do that won't happen, they'd lose too many ad links.

albert_e2y ago

Many comments here seem to suggest that practically it is going to be impossible to classify text as Human generated versus AI generated -- given the many different ways in which such attempts can be foiled in a never ending game of cat-and-mouse.

If we accept this ...

The challenge I am foreseeing is this:

We are only at the very beginning of the AI revolution -- and if LLMs need to get more sophisticated and powerful in future they will need good-quality human-generated / curated training data at a scale that is likely impossible to do manual curation/cleansing/quality-checks on.

And there is no doubt that evey medium is going to get bombarded and spammed with AI-generated content in coming years.

How then, are we going to filter the data -- to separate the real data from AI generated noise -- to train future LLMs on -- and really push them to their potential.

This problem has been bugging me for a while and I commented here previously as well, tentatively calling it 'Data Pollution' for the lack of a better word.

Curious to hear other perspectives on this.

kobalsky2y ago

curation? I mean of the content is considered good, does it matter if it was produced by an LLM?

spandrew2y ago

The only way to prevent AI from answering questions in digital platforms is to develop a ML db on the typing style of every student across their tenure at an institution. Good luck getting that approved — departments can't even access grade or demo data without a steering group going through a 3-deep committee process.

¯\_(ツ)_/¯ try paper I guess. Time to brush up on our OCR.

humanistbot2y ago

If AI can replicate linguistic patterns in a way that is undetectable for both humans and models, then it seems even easier for a ML model to emulate a natural typing style, rhythm, and cadence in a way that is undetectable for both humans and models.

But you know who has more real-world data on typing style? Google, Microsoft, Meta, and everyone else who runs SaaS docs, emails, or messaging. I imagine a lot of students write their essays on Google Docs, Word, or the like, and submit them as attachments or copy-paste into a textbox.

JohnMakin2y ago

We shattered the turing test and now we want to put it back into pandora’s box because we don’t like the repercussions.

CatWChainsaw2y ago

The techbros were so focued on whether or not they could that they accused anyone who asked if they should of being a luddite. And dinner that night was crow, and dessert was humble pie.

callalex2y ago

“We shattered the Turing test” The Turing test is typically an interactive back-and-forth, no? Detecting if paragraphs of pre-written LLM output and giving a yes-or-no answer is not the same thing.

pixl972y ago

We asked the question 'can we beat the turning test', not what would happen when we did.

yk2y ago

Funny incentive problem, OpenAI obviously has an incentive to use it's best AI detection tool for adversarial training, with the result that it's detection tool will not be very good against chatGPT generated text because it is trained to defeat the detection tool.

mitthrowaway22y ago

If they wanted to disguise their content as coming from a human, they probably wouldn't keep saying "As an AI chatbot, I can't..."

yk2y ago

I'm not saying that they are trying to disguise, I'm saying that their goal is natural language and a way you can distinguish chatGPT (aside from trivial things like "as an ai chatbot...") from human speech is most likely a way to improve chatGPT, because the goal is speaking like a human.

jillesvangurp2y ago

This kind of thing strikes me in any case as something that's only good for the generation of AI it's been trained against. And with the exponential improvements happening almost on a monthly basis, that becomes obsolete pretty quickly and a bit of a moving target.

Maybe a better term would be Superior Intelligence (SI). I sure as hell would not be able to pass any legal or medical exams without dedicating the next decade or so to getting there. Nor do I have any interest in doing so. But chat gpt 4 is apparently able to wow its peers. Does that pass the Turing test because it's too smart or too stupid? Most of humanity would fail that test.

anshumankmr2y ago

I read a post on LinkedIn by Timnit Gebru, where she shared an anonymous post by a student who said that their paper was being rejected due to having seemingly high AI written content. Their paper was evaluated by Turnitin who claim to have a built an AI detector. The student mentioned it was putting their career at risk, and they hadn't actually taken the help of any AI and was actually a 4.0 student throughout their college career.

So assuming all that to be true, how can the likes of Turnitin claim to be an authority for AI writing detection. When I graduated a few years back, they used to offer plag check only.

ShamelessC2y ago

> how can the likes of Turnitin claim to be an authority for AI writing detection

Pretty easy - they lie to people.

kylecordes2y ago

There is inherent conflict in having both an AI tool business and an AI tool detection business.

If the first does a good job, the second fails. And vice versa.

(On the other hand, maybe there is a lot of money to be made selling both, to different groups?)

sebzim45002y ago

I don't think this follows. If they wanted, they could crypographically bias the sampling to make the output detectable without decreasing capabilities at all.

Only people using it deceptively would be affected. No idea what portion of ChatGPT's users that is, would be very interested to know.

zarzavat2y ago

There’s a much more effective way: store hashes of each output paragraph (with a minimum entropy) that has ever been generated, and allow people to enter a block of text to search the database.

It wouldn’t beat determined users but it would at least catch the unaware.

1 more reply

wouldbecouldbe2y ago

If the goal is students, then the best would be a tool not only detects AI. But where you can submit previous writing and see how likely it is they wrote a similar text, not so much if it was llm generated.

mercurialsolo2y ago

I wonder why we need this very thing of AI generated. It's a luddite view of AI. Much like the need to distinguish between handcrafted versus machined products - is there a real utility to knowing this?

For educators looking at evaluating students, essays and the like - we possibly need different ways of evaluation rather than on written asynchronous content for communicating concepts and ideas.

pixl972y ago

>is there a real utility to knowing this?

For civics, I would say yes.

Imagine you were talking to an online group about a design project for a local neighborhood. Based on the plurality of voices it seemed like mist people wanted a brown and orange design. But later when you talk to actual people in real life, you could only find a few that actually wanted that.

Virtual beings are a great addition to the bot nets that generate false consensus.

klabb32y ago

I believe you’re exactly right. It would be similar to detecting that math homework used wolfram alpha or even a calculator.

bcherny2y ago

Timing seems related to Altman launching Worldcoin (whose goal is to reliably differentiate between human and AI generated content).

https://www.reuters.com/technology/openais-sam-altman-launch...

hospitalJail2y ago

Not to say "I can detect chatgpt" but it sure seems to have a similar way of talking even when I say things like: Talk like a "Millennial male who is obsessed with Zelda, their name is bob zelenski"

Now the topic isnt about anything millennial or Zelda related, but I'd think that the language model would select sentence and paragraph phrasing differently.

Maybe I need to switch to the API.

post-it2y ago

I've also noticed that ChatGPT tends to respond to short prompts, especially questions, in a predictable format. There are a few characteristics.

First, it tends to print a five-paragraph essay, with an introduction, three main points, and a conclusion.

Second, it signposts really well. Each of the body paragraphs is marked with either a bullet point or a number or something else that says "I'm starting a new point."

Third, it always reads like a WikiHow article. There's never any subtle humour or self-deprecation or ironic understatement. It's very straightforward, like an infographic.

It's definitely easy to recognize a ChatGPT response to a simple prompt if the author hasn't taken any measures to disguise it. The conclusion usually has a generic reminder that your mileage may vary and that you should always be careful.

SquareWheel2y ago

I have to admit I'm struggling to tell if this was done ironically, but your comment is exactly a five paragraph essay with an introduction, three main points, and a conclusion.

If so, nice meta-commentary.

1 more reply

lacker2y ago

Smart of OpenAI to shut down a tool that basically doesn't work before the school year starts and students start to get in trouble based on it.

I think this upcoming school year is going to be a wakeup call for many educators. ChatGPT with GPT-4 is already capable of getting mostly A's on Harvard essay assignments - the best analysis I have seen is this one:

https://www.slowboring.com/p/chatgpt-goes-to-harvard

I'm not sure what instructors will do. Detecting AI-written essays seems technologically intractable, without cooperation from the AI providers, who don't seem too eager to prioritize watermarking functionality when there is so much competition. In the short term, it will probably just be fairly easy to cheat and get a good grade in this sort of class.

woeirua2y ago

Nah, everything is just going to be proctored exams on paper in the future. Sucks for the pro take-home project crowd, but they ruined it for themselves.

PeterStuer2y ago

It's an open ended red-queen problem. You can't win.

Besides, even if they did win, they would still lose by shooting their own foot.

bilater2y ago

As a joke I built a simple tool that swaps random words for their synonym and it did the trick in throwing off any distribution matching (came out with gibberish but still lol) https://www.gptminus1.com/

system22y ago

Fascinating. I tried zerogpt.com for testing a text I generated. Got 91% AI generated response. Used gptminus1.com and now zerogpt.com shows 0% AI. These detectors are garbage.

Roark662y ago

I wonder if this entire "AI cheating" stuff will end up as calculators did when I was growing up. When I was a young child in school calculators were forbidden. You had to learn multiplications in your head. By the time I got to the final exam at the end of our "high school" equivalent we could use "dumb" calculators in math exams. Few years later graphing calculators became accepted in schools (sometimes even required).

It is important humans learn to express themselves in writing. The only way I think this will happen is if kids do their writing at school supervised.

squarefoot2y ago

How could we have both the AI that is indistinguishable from humans and the AI that can detect it with good accuracy? That would imply the race on both sides for an AI that is more intelligent than an high IQ human.

jerf2y ago

Text is a very high dimensional space, n-dimensional in fact. There is plenty of room for an AI to leave a fingerprint that can be detected in some ways but not others.

In fact it doesn't take much text to distinguish between two human beings. The humanly-obvious version is that someone that habitually speaks in one dialect and someone else in another must be separate, but even without such obvious tells humans separate themselves into characterizeable subsets of this space fairly quickly.

I'm skeptical about generalized AI versus human detection in the face of the fact that it is adversarial. But a constant, unmoving target of some specific AI in some particular mode would definitely be detectable; e.g., "ChatGPT's current default voice" would certainly be detectable, "ChatGPT when instructed to sound like Ernest Hemmingway" would be detectable, etc. I just question whether ChatGPT in general can be characterized.

rhyme-boss2y ago

This should have been rejected as an idea just on its face. False positives are really problematic. And if it performs unexpectedly well (accuracy is high) then it just becomes a training tool for reinforcement learning.

bloppe2y ago

Rather than try to detect if pieces were generated by AI, why not just check if they're plagiarized off the outputs of a bunch of popular models? We already have traditional plagiarism analysis methods, we just need a corpus of most of the recent outputs from the most popular LLM services to check against. OpenAI, Google, Anthropic, and any other LLM-as-a-service companies could profitably sell access to these corpora to third-party analysis services that compile them and offer search.

ShamelessC2y ago

One reason would be that LLM's don't tend to output verbatim pieces from their dataset (unless explicitly prompted to do so). This is made further complicated by the "temperature" setting allowing users to make output even more "creative" than its dataset.

In OpenAI's case, its writing style usually comes from OpenAI's in-house dataset they used for RLHF. This is what gives it the ability to chat and respond with its signature (perhaps overly formal and apologetic) tone.

Although it can be used to write in other styles, sometimes it will refuse to because of this.

bloppe2y ago

You misunderstand me. I'm not talking about detecting when LLM's plagiarize humans, but when humans "plagiarize" (copy from without attributing) LLM's.

torginus2y ago

There's an exceedingly simple way of doing this that's pretty much bullet proof: Just check the given text against the database of stored generations (which they no doubt keep) and you'll have a pretty much perfect result. Logistically, searching untold terabytes of text might be challenging but to say there's fundamental difficulties is just not accurate imo.

Grimblewald2y ago

that's the problem - no you won't. The problem is you've bought too much into the stochastic parrot view of LLM's to a point where you are viewing it as a fancy content storage/retrieval system.

kmeisthax2y ago

The idea that OpenAI was intentionally watermarking its output to avoid training data back-contamination should be thoroughly discredited now.

judcnegri2y ago

I think educators got away for too long of handling the problem of umotivated students and uninteresting classes subjects...

Not the educators fault tough, more like the system is bad.

My point is that given knowledge is mostly free and available, the system should teach the students to think rather than using tools or remembering facts

al_be_back2y ago

low accuracy is certainly a good reason to drop a project, especially when dealing with small text (<1000 chars), this is where most social media post/mini-blogs fall under.

bigger text e.g. reports, thesis etc are probably easier & cheaper to verify by humans, with help of A.I. tools (ref checking, searching...)

thepaulthomson2y ago

Wow, OpenAI pulling the plug on its AI Classifier tool really shows how tough it is to spot the difference between human and AI-generated text. It's a bummer that it didn't work out as planned, but hey, that's how we learn, right?

ShamelessC2y ago

No disrespect, but...is this comment AI generated?

vorticalbox2y ago

Could we not add invisible characters into the text a bit like a water mark?

nomel2y ago

Yes, and we could just as easily remove them.

vorticalbox2y ago

True but I feel it would catch a few people out?

tester4572y ago

We could but doing so would lobotomize it.

seeknotfind2y ago

Watermarking is a more tractable approach, but the cat is out of the bag.

sydon2y ago

How would you go about watermarking AI written text?

thewataccount2y ago

https://arxiv.org/pdf/2301.10226.pdf

Here's a decent paper on it.

It covers private watermarking (you can't detect it exists without a key), resistance to modifications, etc. Essentially you wouldn't know it was there and you can't make simple modifications to fool it.

OpenAI could already be doing this, and they could be watermarking with your account ID if they wanted to.

The current best countermeasure is likely paraphrasing attacks https://arxiv.org/pdf/2303.11156.pdf

doctorpangloss2y ago

I don't know.

I suppose hosted solutions like ChatGPT could offer an API where you copy some text in, and it searches its history of generated content to see if anything matches.

> bUt aCtuAlLy...

It's not like I don't know the bajillion limitations here. There are many audiences for detection. All of them are XY Problems. And the people asking for this stuff don't participate on Hacker News aka Unpopular Opinions Technology Edition.

There will probably be a lot of "services" that "just" "tell you" if "it" is "written by an AI."

cateye2y ago

One way is trying to sneak in a specific structure/pattern that is difficult for a human to notice when reading, like using a particular sentence length, paragraph length, or punctuation pattern. Or use certain words in the text that may not be frequently used by humans etc.

Watermarking needs to be subtle enough to be unnoticeable to opposing parties, yet distinctive enough to be detectable.

So, this is an arms race especially because detecting it and altering it based on the watermark is also fun :)

1 more reply

brucethemoose22y ago

Make half of the tokens (the AI's "dictionary") slightly more likely.

This would not impact output quality much, but it would only work for longish outputs. And the token probability "key" could probsbly be reverse engineered with enough output.

1 more reply

taneq2y ago

type=text/chatgpt :P

merlincorey2y ago

Invisible characters in a specific bit-pattern.

Pretty common steganographic technique, really.

2 more replies

mepian2y ago

If it's generated by a SaaS, the service could sign all output with a public key.

3 more replies

cwkoss2y ago

only tractable for closed source hosted LLMs

stormed2y ago

Interesting. I was under the impression this tool was effective because of some sort of hidden patterns generated in sentences. I guess my assumption was way complex than what it actually is

islammidov2y ago

It's very interesting actually. I think from business perspective SEO is the most exposed. Very curious how Google will (or may be now) solve the issue?

atleastoptimal2y ago

Even if AI detectors were 99% effective, anyone could just iterate over an AI produced piece of writing until it's in the 1% that isn't detected and submit it.

janalsncm2y ago

On a related note, I built this site as a PSA:

https://isthiswrittenbyai.surge.sh

partiallypro2y ago

If we can't tell what is written by AI, how will we know if AI is just stuck in a feedback loop in which it references itself (and is wrong)?

cutler2y ago

I urge anyone who values data privacy to refuse to use sites such as this which employ third party data services, ie. popups, which purport to allow you to "Manage" your consent but which hide a long list of opt-out "Legitimate Interest" flags in a "Vendors" list concealed at the bottom of another scrolling list.

ChatGTP2y ago

We're entering a wild world. Our information space is up for the slaughter.

nektro2y ago

it was doomed to fail from the start. being detectable is precisely how the bot is trained. so the moment a tool figures out a classifier, a new generation that undetectable springs up.

hayd2y ago

Was this what Stack Overflow were using to detect automated answers?

Ukv2y ago

None were officially built into the site so it'll vary from moderator to moderator, but the one that mods had a browser script made for to help streamline moderation was RoBERTa Base OpenAI Detector from 2019, created prior to the existence of GPT-3, GPT-3.5 (ChatGPT free), or GPT-4 (ChatGPT pro). It'll be far worse than the 2023 one this article is about.

BigElephant2y ago

No official announcement is lame

system22y ago

No lamer than dropping davinci.

j / k navigate · click thread line to collapse

283 comments

capableweb2y ago

I'm glad that they did, although they should obviously done an announcement for it.

So hopefully this can serve as another example that it's simply not possible to detect if a bunch of characters were outputted by an LLM or not.

feoren2y ago

asddubs2y ago

1 more reply

tessierashpool2y ago

could you contextualize your use of the word "easily" here?

I feel like "easily" might mean "with infinite funds and frictionless spherical developers."

1 more reply

sobellian2y ago

Even without a GAN, it's quite possible that one could simply write `while(rejects(output)) { output = gpt(prompt) }` and obtain a sufficient output after a reasonable number of iterations.

dreamcompiler2y ago

You just outlined an excellent proof of what might be called the AI Halting Problem.

constantcrying2y ago

Even the idea of it is bad, ChatGPT is supposed to write indistinguishably from a human.

bko2y ago

Copying a comment I posted a while ago:

Some work arounds he hasn't figured out yet is asking for an output in language X and then translating it into language Y. But those may still be eventually figured out.

I think watermarking would be a big step forward to practical AI safety and ideally this method would be adopted by all major LLMs.

That part starts around 1 hour 25 min in.

https://axrp.net/episode/2023/04/11/episode-20-reform-ai-ali...

5 more replies

siglesias2y ago

2 more replies

insanitybit2y ago

toss12y ago

Yup.

Moreover, the way to deal with AI in this context is not like the way to deal with plagiarism; do not try to detect AI and punish its use.

1 more reply

Teever2y ago

Nitpick: ChatGPR is supposed to write in a way that is indistinguishable from a human, to another human.

That doesn't mean that it can't be distguishable by some other means.

3 more replies

WiSaGaN2y ago

RandomLensman2y ago

Why even care if it is written by a machine or not? I am not sure it matters as much as people think.

3 more replies

catboybotnet2y ago

There's also the post going around about how it can (and does) falsely flag human posts as AI output, particularly among some autistic people. About as useful as a polygraph, no?

capableweb2y ago

3 more replies

hef198982y ago

We could combine those, couldn't we?

2 more replies

LordDragonfang2y ago

1 more reply

BestGuess2y ago

insanitybit2y ago

The problem isn't people not using it right, the problem is that the tool can never work and just by being out in the world it would cause harm.

2 more replies

cjbgkagh2y ago

1 more reply

RandomLensman2y ago

I think it will be increasingly irrelevant what specific process generated a text, for example. Already before genAI people did not in general query into how politicians' speeches were crafted etc.

2 more replies

andy992y ago

> The amount of people in the ecosystem who thinks it's even possible to detect if something is AI written or not when it's just a couple of sentences is staggering high.

specproc2y ago

I'm still interested in this line of enquiry.

These models are clearly not good enough for decision-making, but still might tell an interesting story.

Here's an easily testable exercise: get a load of news from somewhere like newsapi.ai, run it through an open model and there should be a clear discontinuity around ChatGPT launch.

We can assume false positives and false negatives, but with a fat wadge of data we should still be able to discern trends.

Certainly couldn't accuse a student of cheating with it, but maybe spot content farms.

xattt2y ago

I see no reason why watermarking can’t be broken by having someone simply rephrase/redraw the output.

Yes, it’s still work, but it’s one step removed from having to think up of the original content.

whimsicalism2y ago

Watermarking was never going to be successful except for the most naive uses.

1 more reply

nancyhn2y ago

nradov2y ago

https://mkaku.org/home/tag/quantum-computing/

c-cube2y ago

No need to prove! Any well calibrated bullshit detector should ring at 110dB when applied to claims mixing AI and quantum computing.

(that said, "may eventually be possible" is so weak a claim it's already meaningless. Quantum fluctuations may eventually turn me into a potato but it's not keeping me up at night)

airjtaiwjt2y ago

amelius2y ago

They could certainly keep a database of things generated by /their/ AI ...

gmerc2y ago

Which would be trivially broken with emojis injection or viewpoint shifting.

1 more reply

burtonator2y ago

It's really disturbing to me how many people don't realize it's not possible.

It's like asking a 747 to be made into a dog.

It's completely nonsensical to me.

hackernewds2y ago

Good. If it is not reliable, it's a further harm than good if it exists as a false security.

klabb32y ago

It’s a law of nature that pepperoni thieves cannot take a job at a pizza place. They are forever doomed to be delivery guys.

boondoggle162y ago

This is actually kinda true with doordash etc. Those drivers are completely unvetted, they don't even have an interview.

The kind of people that can't get a job at a pizza place.

pixl972y ago

Eh, I'd consider that a failure of employee training and reverse the situation by giving out a weekly bonus to shifts that did not fail to put the security stickers on.

Kinda like if they forgot to put the security seal on your aspirin, I'm not going to take them all off because someone forgot to run production with all the bottles sealed.

frumper2y ago

2 more replies

mplewis2y ago

Frankly, the kind of person who forgets to put the sticker on at the pizza place will forget about the bonus too.

chakintosh2y ago

This tool's been fueling tons of false accusations in academia. Wife is doing her PhD and she often tells me stories about professors falsely accusing students of using ChatGPT.

practice92y ago

Lots of stories on Reddit about school teachers unfairly accusing students of using ChatGPT for assignments too

Qqqwxs2y ago

A group of students at my university were claiming their papers were being marked by a LLM. They cited a classifier like the OP which they used on their feedback comments.

paxys2y ago

VeninVidiaVicii2y ago

Eh I am doing my PhD and I use ChatGPT all the time!

gigglesupstairs2y ago

And?

The tool in question was used for AI text detection not generation.

1 more reply

Grimblewald2y ago

whats funny is GPT output with even a relatively low temp will come back as human more reliably than human content.

jedberg2y ago

Of course the smart student will easily figure out a way to stream the GPT output into Google Docs, perhaps jumping around to make "edits".

A clever and unethical student is pretty much undetectable no mater what roadblocks you put in their way. This just stops the not clever ones. :)

andromaton2y ago

Proof of work, human version.

marcell2y ago

jacquesm2y ago

> I don’t see LLM’s breaking this approach to classify human writing.

Why not? Record a bunch of humans writing, train model, release. That's orders of magnitude simpler than to come up with the right text to begin with.

1 more reply

TulliusCicero2y ago

ribosometronome2y ago

Retyping the essay from the chatgpt while actively rewording the occassional sentence seems like it would do it.

wmeredith2y ago

It seems like that's nearing the sweet spot of fraud prevention, where committing the act of fraud is as much work as doing the real thing.

4 more replies

mplewis2y ago

It's a bit suspicious to type an essay linearly from start to finish, though.

3 more replies

justrealist2y ago

Well, there is one way, which is timed, proctored exams.

Which sucks, because take-home projects are evaluating a different skill set, and some people thrive on one vs the other. But it is what it is.

matteoraso2y ago

>Of course the smart student will easily figure out a way to stream the GPT output into Google Docs, perhaps jumping around to make "edits".

jwie2y ago

Easy thing to do would be very low maximum word counts. ChatGPT is incapable of brevity.

burtonator2y ago

It's like hearing stories about people hating the first automobiles because they weren't horses.

The education bubble is about to implode - it will probably be one of the first industries killed by AI.

13years2y ago

"Half a year later, that tool is dead, killed because it couldn’t do what it was designed to do."

This was my conclusion as well testing the image detectors.

from my article on AI art - https://www.mindprison.cc/p/ai-art-challenges-meaning-in-a-w...

danuker2y ago

> but it also labeled about 50% of my own stock photo composites I tried as AI generated

Could it be that a large proportion of the source stock photos were actually AI generated?

13years2y ago

sakopov2y ago

klabb32y ago

“Can’t you just try to blend in and be a little more cool? The bouncer is gonna notice.”

Starts talking like Shakespeare

muzani2y ago

rootusrootus2y ago

sebzim45002y ago

tudorw2y ago

Makes fake image, hold it in front of camera, click, verified image...

1 more reply

ummonk2y ago

Just use a certificate chain. The manufacturer can provide each camera its own private key, signed by the manufacturer.

1 more reply

cwkoss2y ago

Cryptography cant save us here, people will figure out how to send AI images to the crypto hardware to get it signed in months. Just would be another similar layer of false security.

malfist2y ago

That's not how cryptographic signing works.

Cryptographic signing means "I wrote this" or "I created this". Sure you could sign an AI generated image as yourself. But you could not sign an image as being created by Getty or NYT

1 more reply

baby_souffle2y ago

> Cryptography cant save us here, people will figure out how to send AI images to the crypto hardware to get it signed in months.

Possibly (who am I kidding. *PROBABLY*!) will use chatGPT to help them design the method :)

TheCaptain48152y ago

Their process is fairly simple, they create a classification model after generating tons of examples from all the major models (ChatGPT, GPT4, Laama, etc).

bearjaws2y ago

Google needs to do a full 180, and only the most succinct website that answers search queries should be elevated.

I cannot wait for them to sink all these spam websites.

pixl972y ago

Waiting for Google to do that won't happen, they'd lose too many ad links.

albert_e2y ago

If we accept this ...

The challenge I am foreseeing is this:

And there is no doubt that evey medium is going to get bombarded and spammed with AI-generated content in coming years.

How then, are we going to filter the data -- to separate the real data from AI generated noise -- to train future LLMs on -- and really push them to their potential.

This problem has been bugging me for a while and I commented here previously as well, tentatively calling it 'Data Pollution' for the lack of a better word.

Curious to hear other perspectives on this.

kobalsky2y ago

curation? I mean of the content is considered good, does it matter if it was produced by an LLM?

spandrew2y ago

¯\_(ツ)_/¯ try paper I guess. Time to brush up on our OCR.

humanistbot2y ago

JohnMakin2y ago

We shattered the turing test and now we want to put it back into pandora’s box because we don’t like the repercussions.

CatWChainsaw2y ago

The techbros were so focued on whether or not they could that they accused anyone who asked if they should of being a luddite. And dinner that night was crow, and dessert was humble pie.

callalex2y ago

pixl972y ago

We asked the question 'can we beat the turning test', not what would happen when we did.

yk2y ago

mitthrowaway22y ago

If they wanted to disguise their content as coming from a human, they probably wouldn't keep saying "As an AI chatbot, I can't..."

yk2y ago

jillesvangurp2y ago

anshumankmr2y ago

So assuming all that to be true, how can the likes of Turnitin claim to be an authority for AI writing detection. When I graduated a few years back, they used to offer plag check only.

ShamelessC2y ago

> how can the likes of Turnitin claim to be an authority for AI writing detection

Pretty easy - they lie to people.

kylecordes2y ago

There is inherent conflict in having both an AI tool business and an AI tool detection business.

If the first does a good job, the second fails. And vice versa.

(On the other hand, maybe there is a lot of money to be made selling both, to different groups?)

sebzim45002y ago

I don't think this follows. If they wanted, they could crypographically bias the sampling to make the output detectable without decreasing capabilities at all.

Only people using it deceptively would be affected. No idea what portion of ChatGPT's users that is, would be very interested to know.

zarzavat2y ago

There’s a much more effective way: store hashes of each output paragraph (with a minimum entropy) that has ever been generated, and allow people to enter a block of text to search the database.

It wouldn’t beat determined users but it would at least catch the unaware.

1 more reply

wouldbecouldbe2y ago

mercurialsolo2y ago

For educators looking at evaluating students, essays and the like - we possibly need different ways of evaluation rather than on written asynchronous content for communicating concepts and ideas.

pixl972y ago

>is there a real utility to knowing this?

For civics, I would say yes.

Virtual beings are a great addition to the bot nets that generate false consensus.

klabb32y ago

I believe you’re exactly right. It would be similar to detecting that math homework used wolfram alpha or even a calculator.

bcherny2y ago

Timing seems related to Altman launching Worldcoin (whose goal is to reliably differentiate between human and AI generated content).

https://www.reuters.com/technology/openais-sam-altman-launch...

hospitalJail2y ago

Not to say "I can detect chatgpt" but it sure seems to have a similar way of talking even when I say things like: Talk like a "Millennial male who is obsessed with Zelda, their name is bob zelenski"

Now the topic isnt about anything millennial or Zelda related, but I'd think that the language model would select sentence and paragraph phrasing differently.

Maybe I need to switch to the API.

post-it2y ago

I've also noticed that ChatGPT tends to respond to short prompts, especially questions, in a predictable format. There are a few characteristics.

First, it tends to print a five-paragraph essay, with an introduction, three main points, and a conclusion.

Second, it signposts really well. Each of the body paragraphs is marked with either a bullet point or a number or something else that says "I'm starting a new point."

Third, it always reads like a WikiHow article. There's never any subtle humour or self-deprecation or ironic understatement. It's very straightforward, like an infographic.

SquareWheel2y ago

I have to admit I'm struggling to tell if this was done ironically, but your comment is exactly a five paragraph essay with an introduction, three main points, and a conclusion.

If so, nice meta-commentary.

1 more reply

lacker2y ago

Smart of OpenAI to shut down a tool that basically doesn't work before the school year starts and students start to get in trouble based on it.

https://www.slowboring.com/p/chatgpt-goes-to-harvard

woeirua2y ago

Nah, everything is just going to be proctored exams on paper in the future. Sucks for the pro take-home project crowd, but they ruined it for themselves.

PeterStuer2y ago

It's an open ended red-queen problem. You can't win.

Besides, even if they did win, they would still lose by shooting their own foot.

bilater2y ago

system22y ago

Fascinating. I tried zerogpt.com for testing a text I generated. Got 91% AI generated response. Used gptminus1.com and now zerogpt.com shows 0% AI. These detectors are garbage.

Roark662y ago

It is important humans learn to express themselves in writing. The only way I think this will happen is if kids do their writing at school supervised.

squarefoot2y ago

jerf2y ago

Text is a very high dimensional space, n-dimensional in fact. There is plenty of room for an AI to leave a fingerprint that can be detected in some ways but not others.

rhyme-boss2y ago

bloppe2y ago

ShamelessC2y ago

Although it can be used to write in other styles, sometimes it will refuse to because of this.

bloppe2y ago

You misunderstand me. I'm not talking about detecting when LLM's plagiarize humans, but when humans "plagiarize" (copy from without attributing) LLM's.

torginus2y ago

Grimblewald2y ago

that's the problem - no you won't. The problem is you've bought too much into the stochastic parrot view of LLM's to a point where you are viewing it as a fancy content storage/retrieval system.

kmeisthax2y ago

The idea that OpenAI was intentionally watermarking its output to avoid training data back-contamination should be thoroughly discredited now.

judcnegri2y ago

I think educators got away for too long of handling the problem of umotivated students and uninteresting classes subjects...

Not the educators fault tough, more like the system is bad.

My point is that given knowledge is mostly free and available, the system should teach the students to think rather than using tools or remembering facts

al_be_back2y ago

low accuracy is certainly a good reason to drop a project, especially when dealing with small text (<1000 chars), this is where most social media post/mini-blogs fall under.

bigger text e.g. reports, thesis etc are probably easier & cheaper to verify by humans, with help of A.I. tools (ref checking, searching...)

thepaulthomson2y ago

ShamelessC2y ago

No disrespect, but...is this comment AI generated?

vorticalbox2y ago

Could we not add invisible characters into the text a bit like a water mark?

nomel2y ago

Yes, and we could just as easily remove them.

vorticalbox2y ago

True but I feel it would catch a few people out?

tester4572y ago

We could but doing so would lobotomize it.

seeknotfind2y ago

Watermarking is a more tractable approach, but the cat is out of the bag.

sydon2y ago

How would you go about watermarking AI written text?

thewataccount2y ago

https://arxiv.org/pdf/2301.10226.pdf

Here's a decent paper on it.

OpenAI could already be doing this, and they could be watermarking with your account ID if they wanted to.

The current best countermeasure is likely paraphrasing attacks https://arxiv.org/pdf/2303.11156.pdf

doctorpangloss2y ago

I don't know.

I suppose hosted solutions like ChatGPT could offer an API where you copy some text in, and it searches its history of generated content to see if anything matches.

> bUt aCtuAlLy...

There will probably be a lot of "services" that "just" "tell you" if "it" is "written by an AI."

cateye2y ago

Watermarking needs to be subtle enough to be unnoticeable to opposing parties, yet distinctive enough to be detectable.

So, this is an arms race especially because detecting it and altering it based on the watermark is also fun :)

1 more reply

brucethemoose22y ago

Make half of the tokens (the AI's "dictionary") slightly more likely.

This would not impact output quality much, but it would only work for longish outputs. And the token probability "key" could probsbly be reverse engineered with enough output.

1 more reply

taneq2y ago

type=text/chatgpt :P

merlincorey2y ago

Invisible characters in a specific bit-pattern.

Pretty common steganographic technique, really.

2 more replies

mepian2y ago

If it's generated by a SaaS, the service could sign all output with a public key.

3 more replies

cwkoss2y ago

only tractable for closed source hosted LLMs

stormed2y ago

Interesting. I was under the impression this tool was effective because of some sort of hidden patterns generated in sentences. I guess my assumption was way complex than what it actually is

islammidov2y ago

It's very interesting actually. I think from business perspective SEO is the most exposed. Very curious how Google will (or may be now) solve the issue?

atleastoptimal2y ago

Even if AI detectors were 99% effective, anyone could just iterate over an AI produced piece of writing until it's in the 1% that isn't detected and submit it.

janalsncm2y ago

On a related note, I built this site as a PSA:

https://isthiswrittenbyai.surge.sh

partiallypro2y ago

If we can't tell what is written by AI, how will we know if AI is just stuck in a feedback loop in which it references itself (and is wrong)?

cutler2y ago

ChatGTP2y ago

We're entering a wild world. Our information space is up for the slaughter.

nektro2y ago

it was doomed to fail from the start. being detectable is precisely how the bot is trained. so the moment a tool figures out a classifier, a new generation that undetectable springs up.

hayd2y ago

Was this what Stack Overflow were using to detect automated answers?

Ukv2y ago

BigElephant2y ago

No official announcement is lame

system22y ago

No lamer than dropping davinci.

j / k navigate · click thread line to collapse