Question answering and learning are just a corner of LLM usage, but they have learning signals for the AI. Say a user asks about Pythagoras, the LLM provides an explanation, the user doesn't get it. The LLM tries again.
Repeat this loop a million times with diverse students and you get a distribution of what kind of explanations work. The model gets better at explaining through its own experience.