> Can you tag for me all photos in my album that contain a kitten, but not a dog, with "kitten", and those that contain a kitten and a dog, with "pets<3"?
This approach is backwards. This is the kind of problem that is easy for a person, but not for an AI. So if M was an AI pretending to be a human, you could use this to determine that. But in this case, the suspicion is that M is a human pretending to be an AI - and they could simply decline to attempt the task, or pretend to be unable (or do a bad job deliberately), and you'd learn nothing from negative results.