Why not? Given enough data, it's possible to train models to differentiate - especially since humans can pick up on the difference pretty well.
> Plus some users might want to legitimately upload things with AI-generated content in it
Excluding videos from training datasets doesn't mean excluding them from Youtube.