undefined | Better HN

0 pointsfuryofantares1y ago0 comments

And if you tell it to think up a word that has an E in position 3 and an L that's somewhere in the word but not in position 2, it's not going to be any better at that if you tell it to answer one letter at a time.

0 comments

layer81y ago

The idea is, instead of five-letter-words, play the game with five-token-words.

furyofantaresOP1y ago

That was my original interpretation, and while all it sees are tokens, roughly none of its training data is metadata about tokenizing. It knows far less about the positions of tokens in words than it does about the positions of letters in words.

layer81y ago

I’m not sure that training data about that would be required. Shouldn’t the model be able to recognize that `["re", "cogn", "ize"]` represents the same sequence of tokens as `recognize`, assuming those are tokens in the model?

More generally, would you say that LLMs are generally unable to reason about sequences of items (not necessarily tokens) and compare them to some definition of “valid” sequences that would arise from the training corpus?

1 more reply

j / k navigate · click thread line to collapse

0 pointsfuryofantares1y ago0 comments

0 comments

layer81y ago

The idea is, instead of five-letter-words, play the game with five-token-words.

furyofantaresOP1y ago

layer81y ago

1 more reply

j / k navigate · click thread line to collapse