However, me personally, I don't want to be lured into some web store when I'm looking for some vaguely related information. Luckily, there's tons of information on the web provided not by commercial entities but by volunteers: wikipedia, forum users (e.g. StackOverflow), blogs. (Sure, some people run blogs as a source of income, but I think that's a small percentage of all bloggers.)
Have you ever looked for a specific recipe just to end up on someone's cooking website where they first tell your their life story before - after scrolling for a half a day - you'll finally find what you've actually come there for (the recipe!) at the bottom of their page? Well, if that was gone, I'd say good riddance!
"But you don't get it", you might interject, "it's not that the boilerplate will disappear in the future, the whole goddamn blog page will disappear, including the recipe you're looking for." Yeah, I get it, sure. But I also have an answer for that: "oh, well" (ymmv).
My point is, I don't mind if less commercial stuff is going to be sustainable in a future version of the web. I'm old enough to have experience the geocities version of the early web that consisted of enthusiasts being online not for commercial interests but for fun. It was less polished and less professional, for sure, but less interesting? I don't think so.
If new people don't discover your site with useful user-created content, they won't contribute to it. You're also cutting off the pipeline for recruiting new users to your forum or Q&A site.
This trend was happening before LLMs entered the arena.
This is because they're a big bunch of assholes and no one wants to deal with that. Their decline started way before ChatGPT came in.
The people who made SO are not going anywhere, there will always be a SO, a wikipedia, a search engine. Let it evolve to the next thing.
But that's just like, my opinion, dude.
...which might be beneficial. A problem they'd been trying to deal with for over a decade was the massive influx of low quality duplicate and "do my homework for me" questions from people who don't even bother looking for a solution. If they've all moved off to AI things, problem solved and maybe SO can return to its high-quality origins?
On the other, I think it's unlikely the fun old geocities era comes back.
We'll probably get stuff that looks like it, but it's hawking nationalist revisionist propaganda instead of occult shapeshifting magic lifted from Sabine Baring-Gould, and a thousand Temple OS-inspired clones instead of python.
People will have less or no motivation to create them, because well, why would they? It will be just a food for AI of some corporation.
And more importantly, people won't be finding and joining communities that produce the websites like stack overflow.
It was nice while it lasted, but likely it will be something that existed only for one generation.
Not sure if we surf the same internets... In the web I am surfing, the more "motivation" (trying to get ad revenue) the author has, the crappier the content is. If I want to find high quality information, invariably I am seeking authors with no "motivation" whatsoever to produce the content (wikipedia, hacker news, reddit with a heavy filter etc.) I'm pretty sure we would be better off if the whole ad industry vanished.
Food that said corporation makes a profit off while paying the author nothing.
Not saying this is good, but it’s the reason behind it.
LLMs are making many of the enthusiasts who were online just "for fun" feel sick for contributing to their training set.
> someone's cooking website where they first tell your their life story before
I haven't seen a recipe page without a "Jump to Recipe" page button in forever.
In the short term it will feel liberating; in the long term it will kill the web.
They won't feel so liberated when they find ads embedded in the model's response in ways that make it difficult to uBlock.
And that's assuming a world where people only ask LLMs and don't care about the provenance or trustworthiness of information at all. Which seems unlikely, even conspiracy nuts have some sources they trust more than others. The web will be fine. It will change drastically with the death of click-based advertising, but I don't see a future where it disappears
They will just have one model that they trust more because (a) it aligns with their views, or (b) it's a sycophant and agrees with anything and everything.
They definitely are not the people most likely to care about clicking "source links".
But unfortunately also sites that generate high quality information (eg independent research, reviews, journalism) will also struggle and be more reliant on paywalls and subscriptions.
But great content creators will want to be rewarded one way or another. Book writers get paid, movie makers as well, so why shouldn't those who share their wisdom in a blog post? If someone is making money off your content and you aren't, you will not be happy. When it's not even attributed to you, it's even worse.
SEO is a response to Google's incentives, and Google can fix it if it wants.
The question is: is there content which is useful, but not provided by volunteers? We see more and more content behind paywalls, and it is a loss for many people who can't pay, because they won't be able to access the same content for free supported by ads.
So the result is poor people are going to lose access to certain contents, while well to do people will still have access.
> many people who can't pay
Everybody is already paying for Spotify and for Netflix. They can pay for mass syndication of textual content. But it needs to be like Spotify or YouTube, where everything and anything goes. Poor people always had access to read newspapers.
Lots of people might be willing to run websites for fun or personal satisfaction or whatever, but how many people will continue to be willing to do so when they don't actually get to present the content to visitors and it's instead just regurgitated by AI? Half the fun of hosting your own website is personalizing it and choosing how to share the content. Even people blogging for fun put a lot of thought into their posts on how to phrase an argument or tell a story. But what's the point when nobody will ever see your actual post, just your thoughts rearranged and presented by AI? Maybe some people only care about the information being out there in any form, but I'd be willing to bet that's yet a smaller subset of even the people who would contribute in a return to geocities version of the web.
We might see a resurgence in curated content but I have my worries. Google gets worse and worse but also traditional curated sites have started simply repost what's trending on Twitter.
People want signal and answers, not the 10 blue links as this post tries to argue.
The other thing is this: most high quality and valuable content can now be produced by individuals and finds distribution on social networks where they can occasionally charge for it as well. The drawback to google indexing those links was also that SEO-companies started targeting these mediums (eg: reddit, medium, forums). We needed an early regulation to minimize the needless hacking of SEO, but we let the market play it out, so it should still play out.
AI is killing the web – can anything save it? - https://news.ycombinator.com/item?id=44623361 - July 2025 (448 comments)
Other related threads?
It was a very short lunch.
I think I'm going to miss the world where I could more easily trace the provenance of information.
Mostly useful when you're only looking for the presence of words or terms in the output (including the presence of related words), rather than a coherent explanation with the quality of human-written text.
Sometimes the response is accidentally a truthful statement if interpreted as human text. The quality of a model is judged by how well-tuned they are for increasing the rate these accidents (for the lack of a better word).
[1]: EDIT: In the sense of "semantic web"; not in the sense of "actually understanding meaning" or any type of psychological sense.
I get links in my responses from Gemini. I would also not describe the response as soup, the answers are often quite specific and in the terms of my prompt instead of the inputs (developer queries are a prime example)
I believe there's strong overlap between technically minded people and ad blockers. Maybe the challenge is that AI search appeals to less technically-minded people, who would have otherwise been exposed to ads?
With that said there are some queries AI does better than search engines. If I have something I'm trying to remember from the distant past with poorly refined definitions I can iterate to a solution much better than reading spam filled sites.
Repeat this loop a million times with diverse students and you get a distribution of what kind of explanations work. The model gets better at explaining through its own experience.
When AI provides a response it is possible to judge that response in hindsight. You look at the next 20 messages or sessions from next days and judge based on what followed. The chat logs provide a way to do long range credit assignment.
The modern abuse of copyrights by the likes of Disney does not negate this otherwise wonderful institution.
Is it? We don't have the technology to duplicate the Earth in, say, 1776 but without copyright, and run an experiment, so all we can point at is a logic argument that we need to incentivize writer and artists and creators. Which I mean, sure. I want to write the next great American novel and not have to work for the rest of my life. and for my children and their children to not have to work either. Is that really for the betterment of society though? You can give some additional logic arguments in favor of that, but without Earth duplication technology, there's nothing that really constitutues real actual proof. The closest comparison we have that I know of is to look at China, which has far weaker intellectual property laws, and, well, they haven't fallen into lawless anarchy.
"Many widely used machine-learning models rely on copyrighted data. For instance, Google finds the most relevant web pages for a search term by relying on a machine learning model trained on copyrighted web data. But the use of copyrighted data by machine learning models that generate content (or give answers to search queries than link to sites with the answers) poses new (reasonable) questions about fair use. By not sharing the proceeds, such systems also kill the incentives to produce original content on which they rely. For instance, if we don’t incentivize content producers, e.g., people who respond to Stack Overflow questions, the ability of these models to answer questions in new areas is likely to be lower. The concern about fair use can be addressed by training on data from content producers who have opted to share their data. The second problem is more challenging. How do you build a system that shares proceeds with content producers?"
https://www.gojiberries.io/generative-ai-and-the-market-for-...
There's a simple solution. People that publish things can put up a paywall and people can pay what the content is worth.
The thing that AI endangers is not valuable content, it's the SEO clickbait cashcow, and as far as I'm concerned, the faster AI kills that off, the better.
That monetization model is corrupt as hell, produces all sorts of perverse incentives, and is the epitome of the enshittification of the web.
Burn, baby, burn.
Valuable content is endangered because writers feel demotivated it their material is just stolen by overfunded big corporations.
Paywalls only work for known publications and not for someone who writes the perfect tutorial on how to solve boot issues in Debian. Why would anyone write that if it's just stolen and monetized without attribution?
Note this is a personal blog without any seo incentives.
There is a renaissance of extremely high quality YouTube videos from creators with very few subscribers. Particularly in Math and Science content.
But the general web is full of a lot really bad websites that effectively just waste people's time.
Here's the flashpoint, there is no chemical flashpoint of sodium bromide closed-jar or not, just plasma flashpoint. So safe, NextGen Midland can supply all your chemical needs: click here for more info <insert hyper-hyper-grabby-link>
but i wouldn't mind getting back to the internet of the 80/90s where you could easily find more genuine content and less aggregators, replicators, marketeers and clickfarms. if that's "killing the internet" then it couldn't happen soon enough (i guess marketeers will not go away no matter what, that's a given).
the fear of decline of original content doesn't seem serious. much of what there is now is endless regurgitation anyway. while most of the free stuff nowadays is indeed just noise, the most valuable, original and quality stuff is free, has always been, and it's there. people have been contributing interesting stuff for multiple reasons and in multiple ways for decades, and still do; it is just buried under tons of rubbish. i see no reason why they would stop. if anything, a less noisy internet could be an incentive, and if gaps in knowledge form that will be even more reason to share and contribute, and things like stackoverflow will come back once llms become obsolete enough.
If this book came out today, in 2025, how would you know that the 420 pages are actually worth your time and not just a bunch of hallucinated LLM slop?
I've been wondering whether Wikipedia and libraries in 2030 will be in a better overall place quality-wise, or will just be overrun.
The last few times I looked for information on YouTube (by typing a keyword phrase or question instead of looking up a specific channel/creator), most of the top results were AI-narrated presentations. Some of those were filled with comments of people correcting obvious mistakes in the content (which as a layperson I would not have seen as mistakes)
Good riddance.
We can be forgiven for not seeing how social media was going to become weaponized against us, how streaming's promise of no ads was only temporary. There's no excuse for not seeing it coming this time.
Maybe we will just go back to pay content as it was before the Internet era? Magazines and such
I enjoy reading some very niche magazines and, at the moment, I cannot tell if it's AI generated or not. So, as long as the reader is entertained, magazines will do just fine.
Either that, or we will develop some sort of proof that the content is human generated, not AI.
Every single time I open up google and try searching for the information... I get frustrated being forced so do the agentic work and sift through the crap... and I fall back on ChatGPT or Gemini.
If LLMs ruin the economic viability of corporate blogspam, that's a net positive for society in my eyes. One of the few net positives we can expect from the AI bubble, as far as I can tell.
Of course, the new problem is that we have a bunch of LLMs trained on corporate blogspam, producing low-quality information that only feels plausible because of its correct grammar and neutral voice.
What would be the mechanism for that? If anything LLMs make SEO spam almost free to make.
you do not have to be too old to remember how the internet got on perfectly fine without this model (and in my view, largely better)
Link is in my profile
This is what killed the web - ads.
The web is unusable without ad blockers.
> The article focuses mainly on the publishing industry, news and magazine sites that rely primarily on visits to their sites and selling ads.
I'm not sure where this comes from. The way forward for publishers of content like newspapers is subscription fees and has been for a long time.
The New York Times, The Wall Street Journal, and The Economist revenues are subscription fee dominant, for example.
You can find examples of national papers of record having successful subscription models for text content. If you're only subscribing to one paper, it'll be one of those. And you can find 1-5 person outfits with strong personal brands (often built in other media) and a loyal following, who specially want a given person's take on things and want to read everything they write. Basically financials that resemble the "1000 true fans" model.
But between those extremes it is a lot harder to make subscriptions work.
https://www.pewresearch.org/journalism/fact-sheet/social-med...
Folks who want more traditional journalism will pay for it.
When it comes to “I have a specific question I need answering and then I’m done” the Web feels horribly clumsy and full of absolute garbage to wade through because they don’t want you to get the answer and go away. They want to milk your eyeballs for impressions and attention.
I prefer a million times today's web, which serves everybody and where I can find all kind of thoughts and ideas, without restriction. You just need to make an effort to find it. I prefer a million times a well stocked supermarket with all the ingredients I need to make anything I want, rather than a restaurant which serves only one meal made perfectly.
Content can’t be free if you want it to be of any quality.
Likewise, a lot of content produced with commercial interest in mind is total garbage (this is e.g. where the term "click-bait" originates from).
There's always quality stuff and crap, no matter whether it's been produced for free or not.
There are lots smaller local websites which can produce useful local content because of ad support. Those may not have enough subscribers to continue behind a paywall.
The big channels nowadays usually have 2 websites: one that is free and full of ads and pop-ups with very superficial news (seemingly written by interns) and one with actual quality analysis, journalism etc. that allow you access of 3 articles a month before you need to pay or something of that sorts.
I think the “serving ads” business hasn’t worked for a while.
This hasn't been the case for more than a decade.
It's been seo crap from a long time.
.. mainly that’s because that’s the only game left
Money is not the only motivation to create.
for 20 years, websites that released idol nudes dominated (https://www.change.org/p/allkpop-com-shut-down-allkpop )
then the first wave of slop that figured out you can buy links (https://x.com/paulkimio/status/1550532282288508929)
and now for the past three and a half years it's been indian news websites that dominate
good riddance to the search-powered internet and seoers, i don't mind bleeding-- i've been bleeding this whole time :)