The PushShift datasets were very useful for giving demos on how big data can be analyzed with more intuitive and interesting results. Here's an older blog post of mine analyzing Reddit data from 2018: https://minimaxir.com/2018/09/modeling-link-aggregators/
I would not be in data science/machine learning now if it weren't for Reddit data.
I don't think that was ever a secret. If you say "I don't want AI to use my data", there's a pretty obvious "for free" implied in there.
This is probably a good move for the Reddit owners to get some revenue for a site rapidly becoming as relevant as Slashdot.
Like, if AI video and art are controversial on there, AI use of users' content is going to be even more controversial to say the least. Perhaps enough to cause another protest or outrage to break out.
This feels like the silliest decision Reddit corporate could make with an IPO.
2. ????
3. $5B IPO price target
4. Profit