Who decided that they should have the power to rule over this? Not only does it appear to be legal, but deciding that dataset compilation is inherently immoral and consent-violating would represent a draconian expansion of IP law that almost no individual should agree to. If I perform statistical research and count the most common English words based on many books, should I be liable to go and ask every author if they're okay with me analyzing their works? If I see a thousand character designs and then pick up some cues and ideas to create my own, should I pay a licensing fee?