Some of the "AI" startups that mix automated intelligence with human fallback have probably got it much more right: Sometimes, you need people.
Why is the action to flag and penalize the site? Why would the action not be "google stops showing that ad"?
I don't find this kind of result surprising at all, particularly given how big Google is. If the site safety team is different from the don't-show-evil-ads team, it's almost an inevitable result, at least, in some point in the evolution of the system(s) and processes involved. It does point out some improvements that are needed.