Tech likes to follow the “ask for forgiveness, not for permission “ motto.
If OpenAI, Facebook, or whoever asked for permission to gobble up all publicly visible data to train a program to output statistically similar data, I don’t believe they would’ve got the permission.
In that sense, I don’t think these models should’ve been made.
I dont think any of those companies would be in the clear. That’s my point.
AI is a copyright black hole, albeit a useful one.