>No implementations of models you’re talking to today are just raw autorrgressive predictors, taking the most likely next token.
Set the temperature to zero and that's exactly what you get. The point is the randomness is something applied externally, not a "core concept" for the LLM.