undefined | Better HN

0 pointsadastra226mo ago0 comments

Common sense is more than just causal reasoning. It is also an ability to draw upon a large database of facts about the world and to know which ones apply to the current situation.

But LLMs achieve both your condition and mine. The attention network makes the causal connections that you speak of, while the multi-layer perceptions store and extract facts that respond to the mix of attention.

It is not commonly described as such, but I think “common sense engine” is a far better description of what a GPT-based LLM is doing than mere next word prediction.

0 comments

whilenot-dev6mo ago

> But LLMs achieve both your condition and mine.

Just to follow: Are you suggesting that Andrej Karpathy is wrong when he talks about the behaviors of ChatGPT (GPT-4), or is GPT-5 just way more SOTA advanced and solved the "reversal curse" of GPT-4?

adastra22OP6mo ago

Well, what does Andrej Karpathy say? Kinda hard to respond without knowing that :)

What I said was true of GPT-2, and much more clearly the case with GPT-3. Unfortunately us plebs don’t have as good insight into later models.

whilenot-dev6mo ago

Just listen to 45secs of the video I linked above if you're interested.

1 more reply

j / k navigate · click thread line to collapse

0 pointsadastra226mo ago0 comments

Common sense is more than just causal reasoning. It is also an ability to draw upon a large database of facts about the world and to know which ones apply to the current situation.

It is not commonly described as such, but I think “common sense engine” is a far better description of what a GPT-based LLM is doing than mere next word prediction.

0 comments

whilenot-dev6mo ago

> But LLMs achieve both your condition and mine.

adastra22OP6mo ago

Well, what does Andrej Karpathy say? Kinda hard to respond without knowing that :)

What I said was true of GPT-2, and much more clearly the case with GPT-3. Unfortunately us plebs don’t have as good insight into later models.

whilenot-dev6mo ago

Just listen to 45secs of the video I linked above if you're interested.

1 more reply

j / k navigate · click thread line to collapse