undefined | Better HN

0 pointsgaryfirestorm4mo ago0 comments

> LLMs would have no hope at conceptualizing any of that.

Counter argument - generating probabilistic tokens (degree of randomness) is core concept for an LLM.

0 comments

mrob4mo ago

It's not. The LLM itself only calculates the probabilities of the next token. Assuming no race conditions in the implementation, this is completely deterministic. The popular LLM inference engine llama.cpp is deterministic. It's the job of the sampler to actually select a token using those probabilities. It can introduce pseudo-randomness if configured to, and in most cases it is configured that way, but there's no requirement to do so, e.g. it could instead always pick the most probable token.

nostrebored4mo ago

This is a poor conceptualization of how LLMs work. No implementations of models you’re talking to today are just raw autorrgressive predictors, taking the most likely next token. Most are presented with a variety of potential options and choose from the most likely set. A repeated hand and flop would not be played exactly the same in many cases (but a 27o would have a higher likelihood of being played the same way).

mrob4mo ago

>No implementations of models you’re talking to today are just raw autorrgressive predictors, taking the most likely next token.

Set the temperature to zero and that's exactly what you get. The point is the randomness is something applied externally, not a "core concept" for the LLM.

2 more replies

j / k navigate · click thread line to collapse

0 comments

mrob4mo ago

nostrebored4mo ago

mrob4mo ago

>No implementations of models you’re talking to today are just raw autorrgressive predictors, taking the most likely next token.

Set the temperature to zero and that's exactly what you get. The point is the randomness is something applied externally, not a "core concept" for the LLM.

2 more replies

j / k navigate · click thread line to collapse