Things only seem different in the LLM when we ask the same question because we dont use the same random seed each time.