I would counter that they were operating under the direction of the corporation and, as such, the corporation should be liable. But, I'm not a lawyer.
My overall point is just that a corporation willfully violating copyright should be treated wholly differently than an individual.
I still think that, regardless of any copyright violation by the corporation in acquiring the content, that training with the data doesn't (or shouldn't) somehow taint the resulting model.