Perhaps the AI can observe a human playing the game and learn a reward function?
Learning from humans: what is inverse reinforcement learning? https://thegradient.pub/learning-from-humans-what-is-inverse...
Much of the difficulty of programming (for someone else) is due to the same thing.
But this seems like at best one of a whole host unexpected effects one might consider. AI that discriminates in a way that society frowns on might not "disrupt the world" in such a visible fashion.
I don't see how one can get away with an entity doing stuff for you with that entity understanding your model of the world.