They say they scraped the open web - so for example this would include many of our personal sites, many of which have profile pictures.
For myself, I took the picture on my site, and it's under a: Attribution, NonCommercial, NoDerivatives CC license. I'd argue that
1. Using my/anyone's profile picture in an AI system for profit is commercial use.
2. A neural network is a derivative work of all images used to train that network.