The image generators accept a prompt, not a question, so we don't expect an answer.
ChatGPT generates responses to prompts that make it sound like it is answering a question that is posed.
Ask ChatGPT a question. The mere fact that I can fairly reasonably pose that as "asking it a question" is why people get confused. It's very easy to interact with it conversationally, and weird to interact with it otherwise because the text it generates is conversational. A generated image is never a conversation, so of course it isn't an answer and so can't be wrong.