I'm surprised people are surprised.
>> That entity will scrape the internet and train the models and claim that "it's just research" to be able to claim that all is fair-use.
a lot of people and entities do this though... openAI is in the spotlight, but scraping everything and selling it is the business model for a lot of companies...