Selection bias is a given. You always have to keep that in mind. But when you actually want to read a specific article, summarizers are useful. For news and general population content, debullshitifiers could come in handy too.
Point being, the texts are not random. There's some nugget of valuable content in it, but it's usually wrapped by enormous layer of SEO, ad hooks, word count padding, and/or general nonsense. Reducing signal-to-noise ratio here - stripping all those layers of bullshit - is strictly useful.