But when all those are harvested to cause mass redundances there's a difference. But, at least with those two cases above, essentially reference material and technical answers, I doubt there's a copyright problem.
What could regulators even do? Force the model owners to regurgitate source material references? Force the model owners to pay a small fee pemr post used? Difficult when most of the material is freely available online with ads.
Besides a lot of the material will come from third parties who have scraped the posts. A whole branch of regulation could appear to try to track the source material which has been put online for free. Or even scraped and then put online again sans copyright to be rescraped without the need to pay anyone.
Maybe we can ask ChatGPT to search its databases for its own sources to its own answers. And even check the source is the original. Problem solved I guess... Feed the overbrain and it'll throw you some pennies while it's used to do you out of a job.