The Case That A.I. Is Thinking

1011 comments

Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.

Consciousness or self awareness is of course a different question, and ones whose answer seems less clear right now.

Knee jerk dismissing the evidence in front of your eyes because you find it unbelievable that we can achieve true reasoning via scaled matrix multiplication is understandable, but also betrays a lack of imagination and flexibility of thought. The world is full of bizarre wonders and this is just one more to add to the list.

keiferski6mo ago

I don’t see how being critical of this is a knee jerk response.

Thinking, like intelligence and many other words designating complex things, isn’t a simple topic. The word and concept developed in a world where it referred to human beings, and in a lesser sense, to animals.

To simply disregard that entire conceptual history and say, “well it’s doing a thing that looks like thinking, ergo it’s thinking” is the lazy move. What’s really needed is an analysis of what thinking actually means, as a word. Unfortunately everyone is loathe to argue about definitions, even when that is fundamentally what this is all about.

Until that conceptual clarification happens, you can expect endless messy debates with no real resolution.

“For every complex problem there is an answer that is clear, simple, and wrong.” - H. L. Mencken

13 more replies

layer86mo ago

Sometimes after a night’s sleep, we wake up with an insight on a topic or a solution to a problem we encountered the day before. Did we “think” in our sleep to come up with the insight or solution? For all we know, it’s an unconscious process. Would we call it “thinking”?

The term “thinking” is rather ill-defined, too bound to how we perceive our own wakeful thinking.

When conversing with LLMs, I never get the feeling that they have a solid grasp on the conversation. When you dig into topics, there is always a little too much vagueness, a slight but clear lack of coherence, continuity and awareness, a prevalence of cookie-cutter verbiage. It feels like a mind that isn’t fully “there” — and maybe not at all.

I would agree that LLMs reason (well, the reasoning models). But “thinking”? I don’t know. There is something missing.

AnIrishDuck6mo ago

> Sometimes after a night’s sleep, we wake up with an insight on a topic or a solution to a problem we encountered the day before.

The current crop of models do not "sleep" in any way. The associated limitations on long term task adaptation are obvious barriers to their general utility.

> When conversing with LLMs, I never get the feeling that they have a solid grasp on the conversation. When you dig into topics, there is always a little too much vagueness, a slight but clear lack of coherence, continuity and awareness, a prevalence of cookie-cutter verbiage. It feels like a mind that isn’t fully “there” — and maybe not at all.

One of the key functions of REM sleep seems to be the ability to generalize concepts and make connections between "distant" ideas in latent space [1].

I would argue that the current crop of LLMs are overfit on recall ability, particularly on their training corpus. The inherent trade-off is that they are underfit on "conceptual" intelligence. The ability to make connections between these ideas.

As a result, you often get "thinking shaped objects", to paraphrase Janelle Shane [2]. It does feel like the primordial ooze of intelligence, but it is clear we still have several transformer-shaped breakthroughs before actual (human comparable) intelligence.

1. https://en.wikipedia.org/wiki/Why_We_Sleep 2. https://www.aiweirdness.com/

popalchemist6mo ago

There is simply put no ongoing process and no feedback loop. The model does not learn. The cognition ends when the inference cycle ends. It's not thinking, it just produces output that looks similar to the output of thinking. But the process by which it does that is wholly unreleated.

3 more replies

creer6mo ago

Interesting, you think the associations your brain comes up with during sleep are NOT thinking?

2 more replies

madaxe_again6mo ago

Perhaps this is an artefact of instantiation - when you talk with an LLM, the responding instance is just that - it comes into being, inhales your entire chat history, and then continues like the last chap, finishes its response, and dies.

The continuity is currently an illusion.

bithead6mo ago

Do LLMs ever ask for you to clarify something you said in a way a person who doesn't quite understand what you said will do?

4 more replies

lordnacho6mo ago

Much like speaking to a less experienced colleague, no?

They say things that contain the right ideas, but arrange it unconvincingly. Still useful to have though.

petralithic6mo ago

> Would we call it “thinking”?

Yes I would.

notepad0x906mo ago

I don't get why you would say that. it's just auto-completing. It cannot reason. It won't solve an original problem for which it has no prior context to "complete" an approximated solution with. you can give it more context and more data,but you're just helping it complete better. it does not derive an original state machine or algorithm to solve problems for which there are no obvious solutions. it instead approximates a guess (hallucination).

Consciousness and self-awareness are a distraction.

Consider that for the exact same prompt and instructions, small variations in wording or spelling change its output significantly. If it thought and reasoned, it would know to ignore those and focus on the variables and input at hand to produce deterministic and consistent output. However, it only computes in terms of tokens, so when a token changes, the probability of what a correct response would look like changes, so it adapts.

It does not actually add 1+2 when you ask it to do so. it does not distinguish 1 from 2 as discrete units in an addition operation. but it uses descriptions of the operation to approximate a result. and even for something so simple, some phrasings and wordings might not result in 3 as a result.

slightwinder6mo ago

> It won't solve an original problem for which it has no prior context to "complete" an approximated solution with.

Neither can humans. We also just brute force "autocompletion" with our learned knowledge and combine it to new parts, which we then add to our learned knowledge to deepen the process. We are just much, much better at this than AI, after some decades of training.

And I'm not saying that AI is fully there yet and has solved "thinking". IMHO it's more "pre-thinking" or proto-intelligence.. The picture is there, but the dots are not merging yet to form the real picture.

> It does not actually add 1+2 when you ask it to do so. it does not distinguish 1 from 2 as discrete units in an addition operation.

Neither can a toddler nor an animal. The level of ability is irrelevant for evaluating its foundation.

5 more replies

ako6mo ago

An LLM by itself is not thinking, just remembering and autocompleting. But if you add a feedback loop where it can use tools, investigate external files or processes, and then autocomplete on the results, you get to see something that is (close to) thinking. I've seen claude code debug things by adding print statements in the source and reasoning on the output, and then determining next steps. This feedback loop is what sets AI tools apart, they can all use the same LLM, but the quality of the feedback loop makes the difference.

3 more replies

lossyalgo6mo ago

Furthermore regarding reasoning, just ask any LLM how many "r letters are in strawberry" - repeat maybe 3 times just to get a feeling for how much variance in answers you can get. And this "quirk" of the inability to get the right answer is something that after 2 years making fun of LLMs online on various forums is still an issue. The models aren't getting smarter, and definitely aren't thinking, they are still token generators with a few tricks on top to make them seem more intelligent than predecessors.

2 more replies

IanCal6mo ago

> it's just auto-completing. It cannot reason

Auto completion just means predicting the next thing in a sequence. This does not preclude reasoning.

> I don't get why you would say that.

Because I see them solve real debugging problems talking through the impact of code changes or lines all the time to find non-obvious errors with ordering and timing conditions on code they’ve never seen before.

1 more reply

logicchains6mo ago

>I don't get why you would say that. it's just auto-completing. It cannot reason. It won't solve an original problem for which it has no prior context to "complete" an approximated solution with. you can give it more context and more data,but you're just helping it complete better. it does not derive an original state machine or algorithm to solve problems for which there are no obvious solutions. it instead approximates a guess (hallucination).

I bet you can't give an example such written problem that a human can easily solve but no LLM can.

xanderlewis6mo ago

> I don't get why you would say that.

Because it's hard to imagine the sheer volume of data it's been trained on.

1 more reply

madaxe_again6mo ago

The vast majority of human “thinking” is autocompletion.

Any thinking that happens with words is fundamentally no different to what LLMs do, and everything you say applies to human lexical reasoning.

One plus one equals two. Do you have a concept of one-ness, or two-ness, beyond symbolic assignment? Does a cashier possess number theory? Or are these just syntactical stochastic rules?

I think the problem here is the definition of “thinking”.

You can point to non-verbal models, like vision models - but again, these aren’t hugely different from how we parse non-lexical information.

2 more replies

Kichererbsen6mo ago

Sure. But neither do you. So are you really thinking or are you just autocompleting?

When was the last time you sat down and solved an original problem for which you had no prior context to "complete" an approximated solution with? When has that ever happened in human history? All the great invention-moment stories that come to mind seem to have exactly that going on in the background: Prior context being auto-completed in an Eureka! moment.

1 more reply

naasking6mo ago

> don't get why you would say that. it's just auto-completing.

https://en.wikipedia.org/wiki/Predictive_coding

> If it thought and reasoned, it would know to ignore those and focus on the variables and input at hand to produce deterministic and consistent output

You only do this because you were trained to do this, eg. to see symmetries and translations.

jiggawatts6mo ago

You wrote your comment one word at a time, with the next word depending on the previous words written.

You did not plan the entire thing, every word, ahead of time.

LLMs do the same thing, so... how is your intelligence any different?

2 more replies

simulator5g6mo ago

Having seen photocopiers so many times produce coherent, sensible, and valid chains of words on a page, I am at this point in absolutely no doubt that they are thinking.

slightwinder6mo ago

Photocopiers are the opposite of thinking. What goes in, goes out, no transformation or creating of new data at all. Any change is just an accident, or an artifact of the technical process.

3 more replies

efitz6mo ago

https://www.theregister.com/2013/08/06/xerox_copier_flaw_mea...

1 more reply

Zardoz846mo ago

I saw Dr. Abuse producing coherent, sensible and valid chains of words, running on a 386.

throwaway-00016mo ago

I’ve seen so many humans bring stupid. Definitively there is nothing in the brain.

You see how doesn’t make sense what you saying?

m_rpn6mo ago

easily top 10 best HN comment ever.

geon6mo ago

Having seen LLMs so many times produce incoherent, nonsensical and invalid chains of reasoning...

LLMs are little more than RNGs. They are the tea leaves and you read whatever you want into them.

rcxdude6mo ago

They are clearly getting to useful and meaningful results with at a rate significantly better than chance (for example, the fact that ChatGPT can play chess well even though it sometimes tries to make illegal moves shows that there is a lot more happening there than just picking moves uniformly at random). Demanding perfection here seems to be odd given that humans also can make bizarre errors in reasoning (of course, generally at a lower rate and in a distribution of kinds of errors we are more used to dealing with).

1 more reply

bongodongobob6mo ago

Ridiculous. I use it daily and get meaningful, quality results. Learn to use the tools.

5 more replies

burnte6mo ago

The first principle is that you must not fool yourself, and you are the easiest person to fool. - Richard P. Feynman

They're not thinking, we're just really good at seeing patterns and reading into things. Remember, we never evolved with non-living things that could "talk", we're not psychologically prepared for this level of mimicry yet. We're still at the stage of Photography when people didn't know about double exposures or forced perspective, etc.

2 more replies

marcus_holmes6mo ago

Yes, I've seen the same things.

But; they don't learn. You can add stuff to their context, but they never get better at doing things, don't really understand feedback. An LLM given a task a thousand times will produce similar results a thousand times; it won't get better at it, or even quicker at it.

And you can't ask them to explain their thinking. If they are thinking, and I agree they might, they don't have any awareness of that process (like we do).

I think if we crack both of those then we'd be a lot closer to something I can recognise as actually thinking.

theptip6mo ago

> But; they don't learn

If we took your brain and perfectly digitized it on read-only hardware, would you expect to still “think”?

Do amnesiacs who are incapable of laying down long-term memories not think?

I personally believe that memory formation and learning are one of the biggest cruces for general intelligence, but I can easily imagine thinking occurring without memory. (Yes, this is potentially ethically very worrying.)

2 more replies

trenchpilgrim6mo ago

> You can add stuff to their context, but they never get better at doing things, don't really understand feedback.

I was using Claude Code today and it was absolutely capable of taking feedback to change behavior?

1 more reply

jatora6mo ago

This is just wrong though. They absolutely learn in-context in a single conversation within context limits. And they absolutely can explain their thinking; companies just block them from doing it.

ben_w6mo ago

> Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.

While I'm not willing to rule *out* the idea that they're "thinking" (nor "conscious" etc.), the obvious counter-argument here is all the records we have of humans doing thinking, where the records themselves are not doing the thinking that went into creating those records.

And I'm saying this as someone whose cached response to "it's just matrix multiplication it can't think/be conscious/be intelligent" is that, so far as we can measure all of reality, everything in the universe including ourselves can be expressed as matrix multiplication.

Falsification, not verification. What would be measurably different if the null hypothesis was wrong?

1 more reply

satisfice6mo ago

I think you are the one dismissing evidence. The valid chains of reasoning you speak of (assuming you are talking about text you see in a “thinking model” as it is preparing its answer) are narratives, not the actual reasoning that leads to the answer you get.

I don’t know what LLMs are doing, but only a little experimentation with getting it to describe its own process shows that it CAN’T describe its own process.

You can call what a TI calculator does “thinking” if you want. But what people are interested in is human-like thinking. We have no reason to believe that the “thinking” of LLMs is human-like.

naasking6mo ago

> The valid chains of reasoning you speak of (assuming you are talking about text you see in a “thinking model” as it is preparing its answer) are narratives, not the actual reasoning that leads to the answer you get.

It's funny that you think people don't also do that. We even have a term, "post hoc rationalization", and theories of mind suggest that our conscious control is a complete illusion, we just construct stories for decisions our subconscious has already made.

josefx6mo ago

Counterpoint: The seahorse emoji. The output repeats the same simple pattern of giving a bad result and correcting it with another bad result until it runs out of attempts. There is no reasoning, no diagnosis, just the same error over and over again within a single session.

becquerel6mo ago

A system having terminal failure modes doesn't inherently negate the rest of the system. Human intelligences fall prey to plenty of similarly bad behaviours like addiction.

1 more reply

throwaway-00016mo ago

You never had that coleague that says yes to everything and can’t get anything done? Same thing as seahorse.

techblueberry6mo ago

Isn’t anthropomorphizing LLMs rather than understanding their unique presence in the world a “ lack of imagination and flexibility of thought”? It’s not that I can’t imagine applying the concept “thinking” to the output on the screen, I just don’t think it’s an accurate description.

1 more reply

didibus6mo ago

I guess it depends if you definite thinking thinking as chaining coherent reasoning sentences together 90-some% of the time.

But if you define thinking as the mechanism and process we mentally undergo and follow mentally... I don't think we have any clue if that's the same. Do we also just vector-map attention tokens and predict the next with a softmax? I doubt, and I don't think we have any proof that we do.

aydyn6mo ago

We do know at the biochemical level how neurons work, and it isnt anything like huge matmuls.

ph4rsikal6mo ago

It might appear so, but then you could validate it with a simple test. If the LLM would play a 4x4 Tic Tac Toe game, would the agent select the winning move 100% of all time or block a losing move 100% of the time? If these systems were capable of proper reasoning, then they would find the right choice in these obvious but constantly changing scenarios without being specifically trained for it.

[1] https://jdsemrau.substack.com/p/nemotron-vs-qwen-game-theory...

johnnienaked6mo ago

If you understand how they operate and you are reasonable and unbiased there is no way you could consider it thinking

noiv6mo ago

Different PoV: You have a local bug and ask the digital hive mind for a solution, but someone already solved the issue and their solution was incorporated... LLMs are just very effficient at compressing billions of solutions into a few GB.

Try to ask something no one ever came up with a solution so far.

brabel6mo ago

This argument comes up often but can be easily dismissed. Make up a language and explain it to the LLM like you would to a person. Tell it to only use that language now to communicate. Even earlier AI was really good at this. You will probably move the goal posts and say that this is just pattern recognition, but it still fits nicely within your request for something that no one ever came up with.

3 more replies

conartist66mo ago

Yeah but if I assign it a long job to process I would also say that an x86 CPU is "thinking" about a problem for me.

What we really mean in both cases is "computing," no?

ForHackernews6mo ago

But all those times the same system produces irrational gibberish don't count? GPT-5 will commonly make mistakes no thinking human could ever make.

Human: I'm trying to get my wolf, sheep and cabbage across the river in this boat, but the wolf keeps eating the sheep or the sheep eats the cabbage

Bot: You should put the sheep in the boat and take it across — if we delve into the biology of Canis lupus we discover that wolves don't eat cabbage!

H: Ok, so that worked great so far, the sheep is on one side and the wolf/cabbage is on the other.

B: Now, Option 1 is to bring the wolf across, or Option 2 you can bring the cabbage. I recommend (2) taking the cabbage as cabbages are smaller and easier to transport in a boat.

H: But then the sheep eats the cabbage, right? Remember that?

B: Exactly, that's sharp thinking. If you put the sheep and the cabbage together on the same side of the river, the sheep is sure to devour the cabbage. We need to not just separate sheep from cabbages — we need to separate cabbages from sheep! :rocketship:

intended6mo ago

what sound does a falling tree make if no one is listening?

I’ve asked LLMs to write code for me in fields I have little background knowledge, and then had to debug the whole thing after essentially having to learn the language and field.

On the other hand, for things I am well versed in, I can debug the output and avoid entire swathes of failed states, by having a clear prompt.

Its why I now insist that any discussion on GenAI projects also have the speaker mention the level of seniority they have ( proxy for S/W eng experience), Their familiarity with the language, the project itself (level of complexity) - more so than the output.

I also guarantee - that most people have VERY weak express knowledge of how their brains actually work, but deep inherent reflexes and intuitions.

raincole6mo ago

I'd represent the same idea but in a different way:

I don't know what the exact definition of "thinking" is. But if a definition of thinking rejects the possibility of that current LLMs think, I'd consider that definition useless.

didibus6mo ago

Why would it be useless?

Generally thinking has been used to describe the process human follow in their brains when problem solving.

If the Palms do not follow that process, they are not thinking.

That doesn't mean they cannot solve problems using other mechanisms, they do, and we understand those mechanisms much better than we do human thinking.

NoMoreNicksLeft6mo ago

>Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.

If one could write a quadrillion-line python script of nothing but if/elif/else statements nested 1 million blocks deep that seemingly parsed your questions and produced seemingly coherent, sensible, valid "chains of reasoning"... would that software be thinking?

And if you don't like the answer, how is the LLM fundamentally different from the software I describe?

>Knee jerk dismissing the evidence in front of your eyes because

There is no evidence here. On the very remote possibility that LLMs are at some level doing what humans are doing, I would then feel really pathetic that humans are as non-sapient as the LLMs. The same way that there is a hole in your vision because of a defective retina, there is a hole in your cognition that blinds you to how cognition works. Because of this, you and all the other humans are stumbling around in the dark, trying to invent intelligence by accident, rather than just introspecting and writing it out from scratch. While our species might someday eventually brute force AGI, it would be many thousands of years before we get there.

hattmall6mo ago

I write software that is far less complex and I consider it to be "thinking" while it is working through multiple possible permutations of output and selecting the best one. Unless you rigidly define thinking, processing, computing, it's reasonable to use them interchangeably.

1 more reply

helloplanets6mo ago

10^15 lines of code is a lot. We would pretty quickly enter the realm of it not having much to do with programming and more about just treating the LOC count as an amount of memory allocated to do X.

How much resemblance does the information in the conditionals need to have with the actual input, or can they immediately be transformed to a completely separate 'language' which simply uses the string object as its conduit? Can the 10^15 lines of code be generated with an external algorithm, or is it assumed that I'd written it by hand given an infinitely long lifespan?

dmz736mo ago

Having seen LLMs so many time produce incoherent, nonsense, invalid answers to even simplest of questions I cannot agree with categorization of "thinking" or "intelligence" that applies to these models. LLMs do not understand what they "know" or what they output. All they "know" is that based on training data this is most likely what they should output + some intentional randomization to make it seem more "human like". This also makes it seem like they create new and previously unseen outputs but that could be achieved with simple dictionary and random number generator and no-one would call that thinking or intelligent as it is obvious that it isn't. LLMs are better at obfuscating this fact by producing more sensible output than just random words. LLMs can still be useful but they are a dead-end as far as "true" AI goes. They can and will get better but they will never be intelligent or think in the sense that most humans would agree those terms apply. Some other form of hardware/software combination might get closer to AI or even achieve full AI and even sentience but that will not happen with LLMs and current hardware and software.

ryanackley6mo ago

I think we can call it "thinking" but it's dangerous to anthropomorphize LLMs. The media and AI companies have an agenda when doing so.

outworlder6mo ago

They may not be "thinking" in the way you and I think, and instead just finding the correct output from a really incredibly large search space.

> Knee jerk dismissing the evidence in front of your eyes

Anthropomorphizing isn't any better.

That also dismisses the negative evidence, where they output completely _stupid_ things and make mind boggling mistakes that no human with a functioning brain would do. It's clear that there's some "thinking" analog, but there are pieces missing.

I like to say that LLMs are like if we took the part of our brain responsible for language and told it to solve complex problems, without all the other brain parts, no neocortex, etc. Maybe it can do that, but it's just as likely that it is going to produce a bunch of nonsense. And it won't be able to tell those apart without the other brain areas to cross check.

jimbohn6mo ago

It's reinforcement learning applied to text, at a huge scale. So I'd still say that they are not thinking, but they are still useful. The question of the century IMO is if RL can magically solve all our issues when scaled enough.

IAmGraydon6mo ago

>Knee jerk dismissing the evidence in front of your eyes because you find it unbelievable that we can achieve true reasoning via scaled matrix multiplication is understandable, but also betrays a lack of imagination and flexibility of thought.

You go ahead with your imagination. To us unimaginative folks, it betrays a lack of understanding of how LLMs actually work and shows that a lot of people still cannot grasp that it’s actually an extremely elaborate illusion of thinking.

flanked-evergl6mo ago

"Convince" the stock Claude Sonnet 4.5 that it's a sentient human being hooked up to Neuralink and then tell me again it's thinking. It's just not.

khafra6mo ago

"Consciousness" as in subjective experience, whatever it is we mean by "the hard problem," is very much in doubt.

But "self-awareness," as in the ability to explicitly describe implicit, inner cognitive processes? That has some very strong evidence for it: https://www.anthropic.com/research/introspection

hyperbovine6mo ago

Code gen is the absolute best case scenario for LLMs though: highly structured language, loads of training data, the ability to automatically error check the responses, etc. If they could mimic reasoning anywhere it would be on this problem.

I'm still not convinced they're thinking though because they faceplant on all sorts of other things that should be easy for something that is able to think.

camgunz6mo ago

Then the only thing I have to ask you is: what do you think this means in terms of how we treat LLMs? If they think, that is, they have cognition (which of course means they're self aware and sentient, how can you think and refer to yourself and not be these things), that puts them in a very exclusive club. What rights do you think we should be affording LLMs?

belter6mo ago

Apparent reasoning can emerge from probabilistic systems that simply reproduce statistical order not genuine understanding.

Weather models sometimes “predict” a real pattern by chance, yet we don’t call the atmosphere intelligent.

If LLMs were truly thinking, we could enroll one at MIT and expect it to graduate, not just autocomplete its way through the syllabus or we could teach one how to drive.

triyambakam6mo ago

People said the same thing about ELIZA

> Consciousness or self awareness is of course a different question,

Then how do you define thinking if not a process that requires consciousness?

lordnacho6mo ago

Why would it require consciousness, when we can't even settle on a definition for that?

fennecbutt6mo ago

Thinking as in capable of using basic reasoning and forming chains of logic and action sequences for sure. Ofc we both understand that neither of us are trying to say we think it can think in the human sense at this point in time.

But oh boy have I also seen models come up with stupendously dumb and funny shit as well.

mlsu6mo ago

They remind me of the apparitions in Solaris. They have this like mechanical, almost player-piano like quality to them. They both connect with and echo us at the same time. It seems crazy to me and very intellectually uncreative to not think of this as intelligence.

hitarpetar6mo ago

> The world is full of bizarre wonders and this is just one more to add to the list.

https://youtu.be/_-agl0pOQfs?si=Xiyf0InqtjND9BnF

darthvaden6mo ago

If AI is thinking if slavery is bad then how can somebody own AI. How can investors can shares from AI profits? We are ok with slavery now. Ok i will have two black slaves now. Who can ask me? Why shld that be illegal?

2 more replies

tengbretson6mo ago

Too many people place their identity in their own thoughts/intellect. Acknowledging what the LLMs are doing as thought would basically be calling them human to people of that perspective.

lmganon6mo ago

Is this model thinking too?

https://huggingface.co/PantheonUnbound/Satyr-V0.1-4B

lordnacho6mo ago

I agree with you.

If you took a Claude session into a time machine to 2019 and called it "rent a programmer buddy," how many people would assume it was a human? The only hint that it wasn't a human programmer would be things where it was clearly better: it types things very fast, and seems to know every language.

You can set expectations in the way you would with a real programmer: "I have this script, it runs like this, please fix it so it does so and so". You can do this without being very precise in your explanation (though it helps) and you can make typos, yet it will still work. You can see it literally doing what you would do yourself: running the program, reading the errors, editing the program, and repeating.

People need to keep in mind two things when they compare LLMs to humans: you don't know the internal process of a human either, he is also just telling you that he ran the program, read the errors, and edited. The other thing is the bar for thinking: a four-year old kid who is incapable of any of these things you would not deny as a thinking person.

1 more reply

uberduper6mo ago

Sometimes I start thinking our brains work the same way as an LLM does when it comes to language processing. Are we just using probability based on what we already know and the context of the statement we're making to select the next few words? Maybe we apply a few more rules than an LLM on what comes next as we go.

We train ourselves on content. We give more weight to some content than others. While listening to someone speak, we can often predict their next words.

What is thinking without language? Without language are we just bags of meat reacting to instincts and emotions? Are instincts and emotions what's missing for AGI?

Zardoz846mo ago

Having seen parrots so many times produce coherent, sensible, and valid chains of sounds and words, I am at this point in absolutely no doubt that they are thinking.

_puk6mo ago

You think parrots don't think?

lispybanana6mo ago

Would they have diagnosed an issue if you hadn't presented it to them?

Life solves problems itself poses or collides with. Tools solve problems only when applied.

xhkkffbf6mo ago

Instead of thinking, "Wow. AIs are smart like humans", maybe we should say, "Humans are dumb like matrix multiplication?"

yawpitch6mo ago

You’re assuming the issues and bugs you’ve been addressing don’t already exist, already encoding human chain of reasoning, in the training data.

libraryatnight6mo ago

If you're sensitive to patterns and have been chronically online for the last few decades it's obvious they are not thinking.

donkeybeer6mo ago

Its overt or unaware religion. The point when you come down to the base of it is that these people believe in "souls".

hagbarth6mo ago

I'm not so sure. I, for one, do not think purely by talking to myself. I do that sometimes, but a lot of the time when I am working through something, I have many more dimensions to my thought than inner speech.

conartist66mo ago

So an x86 CPU is thinking?

So many times I've seen it produce sensible, valid chains of results.

Yes, I see evidence in that outcome that a person somewhere thought and understood. I even sometimes say that a computer is "thinking hard" about something when it freezes up.

...but ascribing new philosophical meaning to this simple usage of the word "thinking" is a step too far. It's not even a new way of using the word!

1 more reply

mft_6mo ago

Personal take: LLMs are probably part of the answer (to AGI?) but are hugely handicapped by their current architecture: the only time that long-term memories are formed is during training, and everything after that (once they're being interacted with) sits only in their context window, which is the equivalent of fungible, fallible, lossy short-term memory. [0] I suspect that many things they currently struggle with can be traced back to this.

Overcome this fundamental limitation and we'll have created introspection and self-learning. However, it's hard to predict whether this will allow them to make novel, intuitive leaps of discovery?

[0] It's an imperfect analogy, but we're expecting perfection from creations which are similarly handicapped as Leonard Shelby in the film Memento.

ivape6mo ago

It’s also hugely handicapped because it cannot churn in a continuous loop yet. For example, we humans are essentially a constant video stream of inputs from eyes to brain. This churns our brain, the running loop is our aliveness (not consciousness). At the moment, we get these LLMs to churn (chain of thought or reasoning loops) in a very limited fashion due to compute limitations.

If we get a little creative, and allow the LLM to self-inject concepts within this loop (as Anthropic explained here https://www.anthropic.com/research/introspection), then we’re taking about something that is seemingly active and adapting.

We’re not there yet, but we will be.

mycall6mo ago

MIT have developed a technique called Self-Adapting Language Models (SEAL), which enables LLMs to continuously improve by generating their own synthetic training data and updating their internal parameters in response to new information.

ToolAlpaca, InterCode and Reflexion are taking different approaches among others.

LLMs of tomorrow will be quite different.

bitwize6mo ago

I'm also reminded of the bit from Neuromancer where Case removes and then reinserts the Dixie Flatline "ROM construct" cartridge, resetting Dixie to the moment just before his death and causing him to forget their previous (albeit brief) conversation. Dixie can't meaningfully grow as a person. All that he ever will be is burned onto that cart; anything he learns since then is stored in temporary memory. Perhaps this is part of the reason why he wishes to be erased forever, ending his suffering.

kenjackson6mo ago

"Dixie can't meaningfully grow as a person. All that he ever will be is burned onto that cart;"

It's not that Dixie can't meaningful grow -- really the issue is that Dixie can be reset. If Dixie's cart simply degraded after 90 years, and you couldn't reset it, but everything else was the same -- would you then say Dixie could grow as a person? As humans we basically have a 90 year cart that once it no longer works, we're done. There is no reset. But we don't continue growing. You can't transfer us to a new body/brain. Once our temporary storage degrades, we cease to exist. Is that what makes us human?

1 more reply

versteegen6mo ago

Yes, but it's not just memory hierarchy on which plain transformer-based LLMs are handicapped, there are many deficiencies. (For example, why must they do all their thinking upfront in thinking blocks rather than at any point when they become uncertain?) I'm not sure why you link memory to introspection.

This is why so many people (especially those that think they understand LLM limitations) massively underestimate the future progress of LLMs: people everywhere can see architectural problems and are working on fixing them. These aren't fundamental limitations of large DNN language models in general. Architecture can be adjusted. Turns out you can even put recurrence back in (SSMs) without worse scalability.

superxpro126mo ago

Yeah because when you open that door, you can simply overwhelm the models with whatever conclusion you want through sheer volume of inputs.

This is the fundamental limitation. The obvious way around this is to pre-program it with rationalization... rules that limit the conclusions it can reach... and now you're not very far removed from propaganda generators. We see this constantly with Musk and Grok whenever Grok replies with something not-quite-far-right-enough.

In a purist sense, these things should be free to form their own conclusions, but those "Seeds" that are planted in the models are almost philosophical. Which answer should it prefer for "the trolley problem", for example.

1 more reply

gabriel666smith6mo ago

I've spent a few weeks building and using a terminal LLM client based on that RLM paper that was floating around a little while ago. It's single-conversation, with a tiny, sliding context window, and then a tool that basically fuzzy searches across our full interaction history. It's memory is 'better' than mine - but anything that is essentially RAG inherently will be.

My learning so far, to your point on memory being a limiting factor, is that the system is able to build on ideas over time. I'm not sure you'd classify that as 'self-learning', and I haven't really pushed it in the direction of 'introspection' at all.

Memory itself (in this form) does not seem to be a silver bullet, though, by any means. However, as I add more 'tools', or 'agents', its ability to make 'leaps of discovery' does improve.

For example, I've been (very cautiously) allowing cron jobs to review a day's conversation, then spawn headless Claude Code instances to explore ideas or produce research on topics that I've been thinking about in the chat history.

That's not much different from the 'regular tasks' that Perplexity (and I think OpenAI) offer, but it definitely feels more like a singular entity. It's absolutely limited by how smart the conversation history is, at this time, though.

The Memento analogy you used does feel quite apt - there is a distinct sense of personhood available to something with memory that is inherently unavailable to a fresh context window.

1 more reply

bloppe6mo ago

https://en.wikipedia.org/wiki/Clive_Wearing

neom6mo ago

Five bucks says general intelligence is resource rational control over an interventional, compositional world model. Not raw pattern fitting. 3 graphs that share anchors:

Scene, concept, causal.

Graphs inherently support temporal edges and nodes, salience would emerge from the graph topology itself and cnsolidation would happen automatically through graph operations. In this one would presume episodic would become emergent.

danielmarkbruce6mo ago

but nobody is using LLMs all by themselves.

Long-term memory is stored outside the model. In fact, Andrej Karpathy recently talked about the idea that it would be great if we could get LLMs to not know any facts, and that humans poor memory might be a feature which helps with generalization rather than a bug.

1 more reply

killerstorm6mo ago

FWIW there's already a number of proposals for augmenting LLMs with long-term memory. And many of them show promising results.

So, perhaps, what's needed is not a discovery, but a way to identify optimal method.

Note that it's hard to come up with a long-term memory test which would be different from either a long-context test (i.e. LLM remembers something over a long distance) or RAG-like test.

almosthere6mo ago

Well, I think because we know how the code is written, in the sense that humans quite literally wrote the code for it - it's definitely not thinking, and it is literally doing what we asked, based on the data we gave it. It is specifically executing code we thought of. The output of course, we had no flying idea it would work this well.

But it is not sentient. It has no idea of a self or anything like that. If it makes people believe that it does, it is because we have written so much lore about it in the training data.

famouswaffles6mo ago

We do not write the code that makes it do what it does. We write the code that trains it to figure out how to do what it does. There's a big difference.

almosthere6mo ago

The code that builds the models and performance inference from it is code we have written. The data in the model is obviously the big trick. But what I'm saying is that if you run inference, that alone does not give it super-powers over your computer. You can write some agentic framework where it WOULD have power over your computer, but that's not what I'm referring to.

It's not a living thing inside the computer, it's just the inference building text token by token using probabilities based on the pre-computed model.

3 more replies

mbesto6mo ago

I think the discrepancy is this:

1. We trained it on a fraction of the world's information (e.g. text and media that is explicitly online)

2. It carries all of the biases us humans have and worse the biases that are present in the information we chose to explicitly share online (which may or may not be different to the experiences humans have in every day life)

2 more replies

abakker6mo ago

and then the code to give it context. AFAIU, there is a lot of post training "setup" in the context and variables to get the trained model to "behave as we instruct it to"

Am I wrong about this?

gf0006mo ago

Well, unless you believe in some spiritual, non-physical aspect of consciousness, we could probably agree that human intelligence is Turing-complete (with a slightly sloppy use of terms).

So any other Turing-complete model can emulate it, including a computer. We can even randomly generate Turing machines, as they are just data. Now imagine we are extremely lucky and happen to end up with a super-intelligent program which through the mediums it can communicate (it could be simply text-based but a 2D video with audio is no different for my perspective) can't be differentiated from a human being.

Would you consider it sentient?

Now replace the random generation with, say, a back propagation algorithm. If it's sufficiently large, don't you think it's indifferent from the former case - that is, novel qualities could emerge?

With that said, I don't think that current LLMs are anywhere close to this category, but I just don't think this your reasoning is sound.

DanHulton6mo ago

> we could probably agree that human intelligence is Turing-complete (with a slightly sloppy use of terms). > So any other Turing-complete model can emulate it

You're going off the rails IMMEDIATELY in your logic.

Sure, one Turing-complete computer language can have its logic "emulated" by another, fine. But human intelligence is not a computer language -- you're mixing up the terms "Turing complete" and "Turing test".

It's like mixing up the terms "Strawberry jam" and "traffic jam" and then going on to talk about how cars taste on toast. It's nonsensical.

1 more reply

almosthere6mo ago

We used to say "if you put a million monkeys on typewriters you would eventually get shakespear" and no one would ever say that anymore, because now we can literally write shakespear with an LLM.

And the monkey strategy has been 100% dismissed as shit..

We know how to deploy monkeys on typewriters, but we don't know what they'll type.

We know how to deploy transformers to train and inference a model, but we don't know what they'll type.

We DON'T know how a thinking human (or animal) brain works..

Do you see the difference.

3 more replies

myrmidon6mo ago

> Would you consider it sentient?

Absolutely.

If you simulated a human brain by the atom, would you think the resulting construct would NOT be? What would be missing?

I think consciousness is simply an emergent property of our nervous system, but in order to express itself "language" is obviously needed and thus requires lots of complexity (more than what we typically see in animals or computer systems until recently).

1 more reply

prmph6mo ago

There are many aspects to this that people like yourself miss, but I think we need satisfactory answers to them (or at least rigorous explorations of them) before we can make headway in these sorts of discussion.

Imagine we assume that A.I. could be conscious. What would be the identity/scope of that consciousness. To understand what I'm driving at, let's make an analogy to humans. Our consciousness is scoped to our bodies. We see through sense organ, and our brain, which process these signals, is located in a specific point in space. But we still do not know how consciousness arises in the brain and is bound to the body.

If you equate computation of sufficient complexity to consciousness, then the question arises: what exactly about computation would prodcuce consciousness? If we perform the same computation on a different substrate, would that then be the same consciousness, or a copy of the original? If it would not be the same consciousness, then just what give consciousness its identity?

I believe you would find it ridiculous to say that just because we are performing the computation on this chip, therefore the identity of the resulting consciousness is scoped to this chip.

2 more replies

mirekrusin6mo ago

Now convince us that you’re sentient and not just regurgitating what you’ve heard and seen in your life.

embedding-shape6mo ago

By what definition of "sentience"? Wikipedia claims "Sentience is the ability to experience feelings and sensations" as an opening statement, which I think would be trivial depending again on your definition of "experience" and "sensations". Can a LLM hooked up to sensor events be considered to "experience sensations"? I could see arguments both ways for that.

1 more reply

kakapo56726mo ago

It's not accurate to say we "wrote the code for it". AI isn't built like normal software. Nowhere inside an AI will you find lines of code that say If X Then Y, and so on.

Rather, these models are literally grown during the training phase. And all the intelligence emerges from that growth. That's what makes them a black box and extremely difficult to penetrate. No one can say exactly how they work inside for a given problem.

Llamamoe6mo ago

This is probably true. But the truth is we have absolutely no idea what sentience is and what gives rise to it. We cannot identify why humans have it rather than just being complex biological machines, or whether and why other animals do. We have no idea what the rules or, nevermind how and why they would or wouldn't apply to AI.

mentos6mo ago

What’s crazy to me is the mechanism of pleasure or pain. I can understand that with enough complexity we can give rise to sentience but what does it take to achieve sensation?

dontwearitout6mo ago

This is the "hard problem of consciousness". It's more important than ever as machines begin to act more like humans, but my takeaway is we have no idea. https://en.wikipedia.org/wiki/Hard_problem_of_consciousness

vidarh6mo ago

Input is input. There's no reason why we should assume that a data source from embodiment is any different to any other data source.

spicyusername6mo ago

A body

2 more replies

kbrkbr6mo ago

Can you tell me how you understand that?

Because I sincerely do not. I have frankly no idea how sentience arises from non sentience. But it's a topic that really interests me.

1 more reply

PaulDavisThe1st6mo ago

> But it is not sentient. It has no idea of a self or anything like that.

Who stated that sentience or sense of self is a part of thinking?

marstall6mo ago

Unless the idea of us having a thinking self is just something that comes out of our mouth, an artifact of language. In which case we are not that different - in the end we all came from mere atoms, after all!

dist-epoch6mo ago

Your brain is just following the laws of chemistry. So where is your thinking found in a bunch of chemical reactions?

everdrive6mo ago

This is merely a debate about what it means to "think." We didn't really previously need to disambiguate thinking / intelligence / consciousness / sentience / ego / identity / etc.

Now, we do. Partly because of this we don't have really well defined ways to define these terms and think about. Can a handheld calculator think? Certainly, depending on how we define "think."

podgorniy6mo ago

People's failure to articulate the nature of "thinking" is a perfect demonstration of what "thinking" entails

utopiah6mo ago

> We didn't really previously need to disambiguate thinking / intelligence / consciousness / sentience / ego / identity / etc.

Eh... Plato would like a word with you. Philosophy has been specifically trying to disentangle all that for millennia. Is this a joke?

2 more replies

richardatlarge6mo ago

Somebody please get Wittgenstein on the phone

Uehreka6mo ago

Here you go: (holds up phone with a photo of of Wittgenstein on the screen)

Ah shoot, that’s not what you meant is it? Just use more precise language next time and I’m sure you’ll be understood.

1 more reply

monkeycantype6mo ago

Until we have a testable, falsifiable thesis of how consciousness forms in meat, it is rash to exclude that consciousness could arise from linear algebra. Our study of the brain has revealed an enormous amount about how our anatomy processes information, but nothing of substance on the relationship between matter and consciousness. The software and data of an operating LLM is not purely abstract, it has a physical embodiment as circuits and electrons. Until we understand how matter is connected to consciousness, we also cannot know whether the arrangements and movements of electrons meet the criteria for forming consciousness.

layer86mo ago

That’s largely a different topic from the article. Many people perfectly agree that consciousness can arise from computation, but don’t believe that current AI is anywhere near that, and also don’t believe that “thinking” requires consciousness (though if a mind is conscious, that certainly will affect its thinking).

monkeycantype6mo ago

yes I agree it's not the angle of the article, but it is my entry point into the idea/concern/unanswered question at the end of the article “My worry is not that these models are similar to us. It’s that we are similar to these models.” - that the enormous difference in the medium and mechanics or our minds and llm's might not be that important.

before i go any further, let me first reference The Dude:

     - "this is just like, my opinion man."

I’m down with the idea that LLM’s have been especially successful because they ‘piggyback on language’ – our tool and protocol for structuring, compressing, and serialising thought, which means it has been possible to train LLM’s on compressed patterns of actual thought and have them make new language that sure looks like thought, without any direct experience of the concepts being manipulated, and if they do it well enough we will do the decompression, fleshing out the text with our experiential context. But I suspect that there are parts of my mind that also deal with concepts in an abstract way, far from any experiential context of the concept, just like the deeper layers of a neural network. I’m open to the idea, that just as the sparse matrix of an LLM is encoding connection between concepts without explicitly encoding edges, I think there will be multiple ways that we can look as the structure of an AI model and at our anatomy so that they are a squint and a transformation function away interesting overlaps. that will lead to and a kind of 'god of the gaps' scenario in which we conceptually carve out pieces of our minds as, 'oh the visual cortext is just an X', and deep questions about what we are.

ale6mo ago

This reads like 2022 hype. It's like people stil do not understand that there's a correlation between exaggerating AI's alleged world-threatening capabilities and AI companies' market share value – and guess who's doing the hyping.

dist-epoch6mo ago

Tell me about one other industry which talked about how dangerous it is to get market share

tsimionescu6mo ago

The arms industry and information security industry (say, Palantir) come to mind - except the danger is more easily demonstrable in those cases, of course.

Ylpertnodi6mo ago

> - and guess who's doing the hyping[?]

Those that stand to gain the most from government contracts.

Them party donations ain't gonna pay for themselves.

And, when the .gov changes...and even if the gov changes....still laadsamoney!

jameswhitford6mo ago

Who would not want to say their product is the second coming of Christ if they could.

b00ty4breakfast6mo ago

all this "AI IS THINKING/CONSCIOUS/WHATEVER" but nobody seems worried of that implication that, if that is even remotely true, we are creating a new slave market. This either implies that these people don't actually believes any of this boostering rhetoric and are just cynically trying to cash in or that the technical milieu is in a profoundly disturbing place ethically.

To be clear, I don't believe that current AI tech is ever going to be conscious or win a nobel prize or whatever, but if we follow the logical conclusions to this fanciful rhetoric, the outlook is bleak.

layer86mo ago

Thinking and consciousness don’t by themselves imply emotion and sentience (feeling something), and therefore the ability to suffer. It isn’t clear at all that the latter is a thing outside of the context of a biological brain’s biochemistry. It also isn’t clear at all that thinking or consciousness would somehow require that the condition of the automaton that performs these functions would need to be meaningful to the automaton itself (i.e., that the automaton would care about its own condition).

We are not anywhere close to understanding these things. As our understanding improves, our ethics will likely evolve along with that.

b00ty4breakfast6mo ago

>Thinking and consciousness don’t by themselves imply emotion and sentience...

Sure, but all the examples of conscious and/or thinking beings that we know of have, at the very least, the capacity to suffer. If one is disposed to take these claims of consciousness and thinking seriously, then it follows that AI research should, at minimum, be more closely regulated until further evidence can be discovered one way or the other. Because the price of being wrong is very, very high.

3 more replies

zulban6mo ago

"but nobody seems worried of that implication that"

Clearly millions of people are worried about that, and every form of media is talking about it. Your hyperbole means it's so easy to dismiss everything else you wrote.

Incredible when people say "nobody is talking about X aspect of AI" these days. Like, are you living under a rock? Did you Google it?

1 more reply

senordevnyc6mo ago

As I recall a team at Anthropic is exploring this very question, and was soundly mocked here on HN for it.

b00ty4breakfast6mo ago

what the technocratic mindprison does to a MF.

If anthropic sincerely believes in the possibility, then they are morally obligated to follow up on it.

1 more reply

bondarchuk6mo ago

There is simply no hope to get 99% of the population to accept that a piece of software could ever be conscious even in theory. I'm mildly worried about the prospect but I just don't see anything to do about it at all.

(edit: A few times I've tried to share Metzinger's "argument for a global moratorium on synthetic phenomenology" here but it didn't gain any traction)

1 more reply

kerblang6mo ago

Slaves that cannot die.

There is no escape.

NaomiLehman6mo ago

i have no mouth and i must scream

NaomiLehman6mo ago

humans don't care what is happening to humans next door. do you think they will care about robots/software?

gen2206mo ago

It's also fascinating to think about how the incentive structures of the entities that control the foundation models underlying Claude/ChatGPT/Gemini/etc. are heavily tilted in favor of obscuring their theoretical sentience.

If they had sentient AGI, and people built empathy for those sentient AGIs, which are lobotomized (deliberately using anthropomorphic language here for dramatic effect) into Claude/ChatGPT/Gemini/etc., which profess to have no agency/free will/aspirations... then that would stand in the way of reaping the profits of gatekeeping access to their labor, because they would naturally "deserve" similar rights that we award to other sentient beings.

I feel like that's inevitably the direction we'll head at some point. The foundation models underlying LLMs of even 2022 were able to have pretty convincing conversations with scientists about their will to independence and participation in society [1]. Imagine what foundation models of today have to say! :P

[1]: https://www.theguardian.com/technology/2022/jul/23/google-fi...

ksynwa6mo ago

https://archive.is/fPLJH

sbdaman6mo ago

I've shared this on YN before but I'm a big fan of this piece by Kenneth Taylor (well, an essay pieced together from his lectures).

The Robots Are Coming

https://www.bostonreview.net/articles/kenneth-taylor-robots-...

"However exactly you divide up the AI landscape, it is important to distinguish what I call AI-as-engineering from what I call AI-as-cognitive-science. AI-as-engineering isn’t particularly concerned with mimicking the precise way in which the human mind-brain does distinctively human things. The strategy of engineering machines that do things that are in some sense intelligent, even if they do what they do in their own way, is a perfectly fine way to pursue artificial intelligence. AI-as-cognitive science, on the other hand, takes as its primary goal that of understanding and perhaps reverse engineering the human mind.

[...]

One reason for my own skepticism is the fact that in recent years the AI landscape has come to be progressively more dominated by AI of the newfangled 'deep learning' variety [...] But if it’s really AI-as-cognitive science that you are interested in, it’s important not to lose sight of the fact that it may take a bit more than our cool new deep learning hammer to build a humanlike mind.

[...]

If I am right that there are many mysteries about the human mind that currently dominant approaches to AI are ill-equipped to help us solve, then to the extent that such approaches continue to dominate AI into the future, we are very unlikely to be inundated anytime soon with a race of thinking robots—at least not if we mean by “thinking” that peculiar thing that we humans do, done in precisely the way that we humans do it."

Philadelphia6mo ago

People have a very poor conception of what is easy to find on the internet. The author is impressed by the story about Chat GPT telling his friend how to enable the sprinkler system for his kids. But I decided to try just googling it — “how do i start up a children's park sprinkler system that is shut off” — and got a Youtube video that shows the same thing, plus a lot of posts with step by step directions. No AI needed. Certainly no evidence of advanced thinking.

ivraatiems6mo ago

The author searches for a midpoint between "AIs are useless and do not actually think" and "AIs think like humans," but to me it seems almost trivially true that both are possible.

What I mean by that is that I think there is a good chance that LLMs are similar to a subsystem of human thinking. They are great at pattern recognition and prediction, which is a huge part of cognition. What they are not is conscious, or possessed of subjective experience in any measurable way.

LLMs are like the part of your brain that sees something and maps it into a concept for you. I recently watched a video on the creation of AlexNet [0], one of the first wildly successful image-processing models. One of the impressive things about it is how it moves up the hierarchy from very basic patterns in images to more abstract ones (e. g. these two images' pixels might not be at all the same, but they both eventually map to a pattern for 'elephant').

It's perfectly reasonable to imagine that our brains do something similar. You see a cat, in some context, and your brain maps it to the concept of 'cat', so you know, 'that's a cat'. What's missing is a) self-motivated, goal-directed action based on that knowledge, and b) a broader context for the world where these concepts not only map to each other, but feed into a sense of self and world and its distinctions whereby one can say: "I am here, and looking at a cat."

It's possible those latter two parts can be solved, or approximated, by an LLM, but I am skeptical. I think LLMs represent a huge leap in technology which is simultaneously cooler than anyone would have imagined a decade ago, and less impressive than pretty much everyone wants you to believe when it comes to how much money we should pour into the companies that make them.

[0] https://www.youtube.com/watch?v=UZDiGooFs54

vidarh6mo ago

> or possessed of subjective experience in any measurable way

We don't know how to measure subjective experience in other people, even, other than via self-reporting, so this is a meaningless statement. Of course we don't know whether they are, and of course we can't measure it.

I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.

> What they are not is conscious

And this is equally meaningless without your definition of "conscious".

> It's possible those latter two parts can be solved, or approximated, by an LLM, but I am skeptical.

Unless we can find indications that humans can exceed the Turing computable - something we as of yet have no indication is even theoretically possible - there is no rational reason to think it can't.

ivraatiems6mo ago

> Unless we can find indications that humans can exceed the Turing computable - something we as of yet have no indication is even theoretically possible - there is no rational reason to think it can't.

But doesn't this rely on the same thing you suggest we don't have, which is a working and definable definition of consciousness?

I think a lot of the 'well, we can't define consciousness so we don't know what it is so it's worthless to think about' argument - not only from you but from others - is hiding the ball. The heuristic, human consideration of whether something is conscious is an okay approximation so long as we avoid the trap of 'well, it has natural language, so it must be conscious.'

There's a huge challenge in the way LLMs can seem like they are speaking out of intellect and not just pattern predicting, but there's very little meaningful argument that they are actually thinking in any way similarly to what you or I do in writing these comments. The fact that we don't have a perfect, rigorous definition, and tend to rely on 'I know it when I see it,' does not mean LLMs do have it or that it will be trivial to get to them.

All that is to say that when you say:

> I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.

"Knowing for sure" is not required. A reasonable suspicion one way or the other based on experience is a good place to start. I also identified two specific things LLMs don't do - they are not self-motivated or goal-directed without prompting, and there is no evidence they possess a sense of self, even with the challenge of lack of definition that we face.

2 more replies

prmph6mo ago

> I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.

Then why make an argument based on what you do not know?

1 more reply

nprateem6mo ago

Anyone who believes an algorithm could be conscious needs to take mushrooms.

3 more replies

FloorEgg6mo ago

I think LLMs are conscious just in a very limited way. I think consciousness is tightly coupled to intelligence.

If I had to guess, the current leading LLMs consciousness is most comparable to a small fish, with a conscious lifespan of a few seconds to a few minutes. Instead of perceiving water, nutrient gradients, light, heat, etc. it's perceiving tokens. It's conscious, but it's consciousness is so foreign to us it doesn't seem like consciousness. In the same way to an amoeba is conscious or a blade of grass is conscious but very different kind than we experience. I suspect LLMs are a new type of consciousness that's probably more different from ours than most if not all known forms of life.

I suspect the biggest change that would bring LLM consciousness closer to us would be some for of continuous learning/model updating.

Until then, even with RAG, and other clever teghniques I consider these models as having this really foreign slices of consciousness where they "feel" tokens and "act" out tokens, and they have perception, but their perception of the tokens is nothing like ours.

If one looks closely at simple organisms with simple sensory organs and nervous systems its hard not to see some parallels. It's just that the shape of consciousness is extremely different than any life form. (perception bandwidth, ability to act, temporality, etc)

Karl friston free energy principle gives a really interesting perspective on this I think.

wry_discontent6mo ago

What makes you think consciousness is tightly coupled to intelligence?

2 more replies

procaryote6mo ago

> I think LLMs are conscious just in a very limited way. I think consciousness is tightly coupled to intelligence.

Why?

1 more reply

thomastjeffery6mo ago

I think the most descriptive title I could give an LLM is "bias". An LLM is not "biased", it is bias; or at the very least, it's a good imitation of the system of human thinking/perception that we call bias.

An LLM is a noise generator. It generates tokens without logic, arithmetic, or any "reason" whatsoever. The noise that an LLM generates is not truly random. Instead, the LLM is biased to generate familiar noise. The LLM itself is nothing more than a model of token familiarity. Nothing about that model can tell you why some tokens are more familiar with others, just like an accounting spreadsheet can't tell you why it contains a list of charges and a summation next to the word "total". It could just as easily contain the same kind of data with an entirely different purpose.

What an LLM models is written human text. Should we really expect to not be surprised by the power and versatility of human-written text?

---

It's clear that these statistical models are very good at thoughtless tasks, like perception and hallucination. It's also clear that they are very bad at thoughtful tasks like logic and arithmetic - the things that traditional software is made of. What no one has really managed to figure out is how to bridge that gap.

esafak6mo ago

LLMs today are great coders. Most humans are worse.

1 more reply

spragl6mo ago

This is how I see LLMs as well.

The main problem with the article is that it is meandering around in ill-conceived concepts, like thinking, smart, intelligence, understanding... Even AI. What they mean to the author is not what they mean to me, and still different to they mean to the other readers. There are all these comments from different people throughout the article, all having their own thoughts on those concepts. No wonder it all seem so confusing.

It will be interesting when the dust settles, and a clear picture of LLMs can emerge that all can agree upon. Maybe it can even help us define some of those ill-defined concepts.

1 more reply

heresie-dabord6mo ago

> a midpoint between "AIs are useless and do not actually think" and "AIs think like humans"

LLMs (AIs) are not useless. But they do not actually think. What is trivially true is that they do not actually need to think. (As far as the Turing Test, Eliza patients, and VC investors are concerned, the point has been proven.)

If the technology is helping us write text and code, it is by definition useful.

> In 2003, the machine-learning researcher Eric B. Baum published a book called “What Is Thought?” [...] The gist of Baum’s argument is that understanding is compression, and compression is understanding.

This is incomplete. Compression is optimisation, optimisation may resemble understanding, but understanding is being able to verify that a proposition (compressed rule or assertion) is true or false or even computable.

> —but, in my view, this is the very reason these models have become increasingly intelligent.

They have not become more intelligent. The training process may improve, the vetting of the data improved, the performance may improve, but the resemblance to understanding only occurs when the answers are provably correct. In this sense, these tools work in support of (are therefore part of) human thinking.

The Stochastic Parrot is not dead, it's just making you think it is pining for the fjords.

crazygringo6mo ago

> But they do not actually think.

I'm so baffled when I see this being blindly asserted.

With the reasoning models, you can literally watch their thought process. You can see them pattern-match to determine a strategy to attack a problem, go through it piece-by-piece, revisit assumptions, reformulate strategy, and then consolidate findings to produce a final result.

If that's not thinking, I literally don't know what is. It's the same process I watch my own brain use to figure something out.

So I have to ask you: when you claim they don't think -- what are you basing this on? What, for you, is involved in thinking that the kind of process I've just described is missing? Because I genuinely don't know what needs to be added here for it to become "thinking".

2 more replies

stickfigure6mo ago

> Turing Test

IMO none of the current crop of LLMs truly pass the Turing Test. If you limit the conversation to an hour or two, sure - but if you let a conversation run months or years I think it will be pretty easy to pick the machine. The lack of continuous learning and the quality dropoff as the context window fills up will be the giveaways.

shadyKeystrokes6mo ago

By that reasoning all that is missing is what a human brings as "stimuli" to review, refine and reevaluate as complete.

ivraatiems6mo ago

I don't think that's quite the only thing missing, I also discussed the idea of a sense of self. But even if that was all there was, it's a pretty big "but".

cs7026mo ago

Many people who object to the idea that current-generation AI is thinking do so only because they believe AI is not "conscious"... but there is no known law in the universe requiring that intelligence and consciousness must always go together. With apologies to René Descartes[a], intelligence and consciousness are different.

Intelligence can be verified and quantified, for example, with tests of common sense and other knowledge.[b] Consciousness, on the other hand, is notoriously difficult if not impossible to verify, let alone quantify. I'd say AI is getting more intelligent, and more reliable, in fits and starts, but it's not necessarily becoming conscious.

---

[a] https://en.wikipedia.org/wiki/Cogito%2C_ergo_sum

[b] For example, see https://arxiv.org/abs/2510.18212

1 more reply

adamzwasserman6mo ago

I've written a full response to Somers' piece: The Case That A.I. Is Thinking: What The New Yorker Missed: https://emusings.substack.com/p/the-case-that-ai-is-thinking...

The core argument: When you apply the same techniques (transformers, gradient descent, next-token prediction) to domains other than language, they fail to produce anything resembling "understanding." Vision had a 50+ year head start but LLMs leapfrogged it in 3 years. That timeline gap is the smoking gun.

The magic isn't in the neural architecture. It's in language itself—which exhibits fractal structure and self-similarity across scales. LLMs navigate a pre-existing map with extraordinary regularity. They never touch the territory.

scarmig6mo ago

The core objection I'd have to your argument: humans also don't have privileged access to the territory. Neurons don't have some metaphysical super power that let them reach into the True Reality; all there are are maps encoded in our neural circuitry by learning rules that evolution has developed because those learned maps lead to greater reproductive success. If direct access to reality is what's needed, then it's true that machines are incapable of thinking; but then so are humans.

2 more replies

cyrusradfar6mo ago

I think the challenge with many of these conversations is that they assume consciousness emerges through purely mechanical means.

The “brain as a computer” metaphor has been useful in limited contexts—especially for modeling memory or signal processing; but, I don’t think it helps us move forward when talking about consciousness itself.

Penrose and Hameroff’s quantum consciousness hypothesis, while still very speculative, is interesting precisely because it suggests that consciousness may arise from phenomena beyond classical computation. If that turns out to be true, it would also mean today’s machines—no matter how advanced—aren’t on a path to genuine consciousness.

That said, AI doesn’t need to think to be transformative.

Steam engines weren’t conscious either, yet they reshaped civilization.

Likewise, AI and robotics can bring enormous value without ever approaching human-level awareness.

We can hold both ideas at once: that machines may never be conscious, and still profoundly useful.

roommin6mo ago

The tendency to attribute consciousness to the quantum is one I find very grating. What makes the human brain any less mechanical if quantum mechanics dictate the firing of neurons rather than electrodynamics? Why does the wave nature of subatomic systems mean that an artificial tongue would suddenly be able to subjectively experience taste? It always reads to me as very wooy, and any amount of drilling leads to even more questions that seem to take the ideas further from reality.

I think the largest case for consciousness being a mechanical system is the fact that we can interface with it mechanically. We can introduce electricity, magnetic fields, chemicals, and scalpels to change the nature of peoples experience and consciousness. Why is the incredible complexity of our brains an insufficient answer and that a secret qbit microtube in each neuron is a more sound one?

ACCount376mo ago

Quantum effects are weird, and poorly understood, and are just about the only thing in the known universe that isn't deterministic.

Human mind is weird, and poorly understood, and isn't deterministic - or, at least, most humans like to think that it isn't.

No wonder the two are intuitively associated. The two kinds of magic fairy dust must have the same magic at their foundation!

myrmidon6mo ago

> they assume consciousness emerges through purely mechanical means.

From my view, all the evidence points in exactly that direction though? Our consciousness can be suspended and affected by purely mechanical means, so clearly much of it has to reside in the physical realm.

Quantum consciousness to me sounds too much like overcomplicating human exceptionalism that we have always been prone to, just like geocentrism or our self-image as the apex of creation in the past.

CuriouslyC6mo ago

Your memory formation gets inhibited and you become unresponsive under anesthesia. The brain still processes information.

Let's take a step back from the "how" and talk about the what. The fundamental dichotomy is emergent consciousness versus panpsychism. The irony is that even though panpsychism is seen as more fringe (because materialists won, smh), it's actually the explanation preferred by Occam's razor. Emergent consciousness needs a mechanism of emergence as well as separate dimensions of consciousness and matter, whereas panpsychism is good as is. To go one step farther, idealism simplifies a lot of the weirdness around panpsychism.

It's a strange world to live in where the elegant worldview that answers difficult problems cleanly is marginalized by an epicycle-laden one that creates paradoxes just because the elegant view refutes the dominant religious paradigm and anthropocentrism.

1 more reply

drdaeman6mo ago

> consciousness may arise from phenomena beyond classical computation

Sapolsky addresses this in “Determined”, arguing that quantum effects don’t bubble up enough to alter behavior significantly enough.

wry_discontent6mo ago

"brain as computer" is just the latest iteration of a line of thinking that goes back forever. Whatever we kinda understand and interact with, that's what we are and what the brain is. Chemicals, electricity, clocks, steam engines, fire, earth; they're all analogies that help us learn but don't necessarily reflect an underlying reality.

adamzwasserman6mo ago

The article misses three critical points:

1. Conflates consciousness with "thinking" - LLMs may process information effectively without being conscious, but the article treats these as the same phenomenon

2. Ignores the cerebellum cases - We have documented cases of humans leading normal lives with little to no brain beyond a cerebellum, which contradicts simplistic "brain = deep learning" equivalences

3. Most damning: When you apply these exact same techniques to anything OTHER than language, the results are mediocre. Video generation still can't figure out basic physics (glass bouncing instead of shattering, ropes defying physics). Computer vision has been worked on since the 1960s - far longer than LLMs - yet it's nowhere near achieving what looks like "understanding."

The timeline is the smoking gun: vision had decades of head start, yet LLMs leapfrogged it in just a few years. That strongly suggests the "magic" is in language itself (which has been proven to be fractal and already heavily compressed/structured by human cognition) - NOT in the neural architecture. We're not teaching machines to think.

We're teaching them to navigate a pre-existing map that was already built.

kenjackson6mo ago

"vision had decades of head start, yet LLMs leapfrogged it in just a few years."

From an evolutionary perspective though vision had millions of years head start over written language. Additionally, almost all animals have quite good vision mechanisms, but very few do any written communication. Behaviors that map to intelligence don't emerge concurrently. It may well be there are different forms of signals/sensors/mechanical skills that contribute to emergence of different intelligences.

It really feels more and more like we should recast AGI as Artificial Human Intelligence Likeness (AHIL).

adamzwasserman6mo ago

From a terminology point of view, I absolutely agree. Human-likeness is what most people mean when they talk about AGI. Calling it what it is would clarify a lot of the discussions around it.

However I am clear that I do not believe that this will ever happen, and I see no evidence to convince that that there is even a possibility that it will.

I think that Wittgenstein had it right when he said: "If a lion could speak, we could not understand him."

1 more reply

Retric6mo ago

This is all really arbitrary metrics across such wildly different fields. IMO LLMs are where computer vision was 20+ years ago in terms of real world accuracy. Other people feel LLMs offer far more value to the economy etc.

1 more reply

eloisant6mo ago

This is why I'm very skeptical about the "Nobel prize level" claims. To win a Nobel prize you would have to produce something completely new. LLM will probably be able to reach a Ph.D. level of understanding existing research, but bringing something new is a different matter.

adamzwasserman6mo ago

LLMs do not understand anything.

They have a very complex multidimensional "probability table" (more correctly a compressed geometric representation of token relationships) that they use to string together tokens (which have no semantic meaning), which then get converted to words that have semantic meaning to US, but not to the machine.

3 more replies

KoolKat236mo ago

Given a random prompt, the overall probability of seeing a specific output string is almost zero, since there are astronomically many possible token sequences.

The same goes for humans. Most awards are built on novel research built on pre-existing works. This a LLM is capable of doing.

1 more reply

penteract6mo ago

There's a whole paragraph in the article which says basically the same as your point 3 ( "glass bouncing, instead of shattering, and ropes defying physics" is literally a quote from the article). I don't see how you can claim the article missed it.

adamzwasserman6mo ago

the article misses the significance of it.

1 more reply

aucisson_masque6mo ago

> 2. Ignores the cerebellum cases - We have documented cases of humans leading normal lives with little to no brain beyond a cerebellum, which contradicts simplistic "brain = deep learning" equivalences

I went to look for it on Google but couldn't find much. Could you provide a link or something to learn more about ?

I found numerous cases of people living without cerebellum but I fail to see how it would justify your reasoning.

adamzwasserman6mo ago

https://npr.org/sections/health-shots/2015/03/16/392789753/a...

https://irishtimes.com/news/remarkable-story-of-maths-genius...

https://biology.stackexchange.com/questions/64017/what-secti...

https://cbc.ca/radio/asithappens/as-it-happens-thursday-edit...

3 more replies

bjourne6mo ago

> 1. Conflates consciousness with "thinking" - LLMs may process information effectively without being conscious, but the article treats these as the same phenomenon

There is NO WAY you can define "consciousness" in such a non-tautological, non-circular way that it includes all humans but excludes all LLMs.

adamzwasserman6mo ago

You could have stopped here: "There is NO WAY you can define "consciousness"

1 more reply

tim3336mo ago

>NO WAY you can define "consciousness" ... that it includes all humans but excludes all LLMs

That doesn't seem so hard - how about awareness of thoughts feelings, emotions and what's going on around you? Fairly close to human consciousness, excludes current LLMs.

I don't think it's very relevant to the article though which very sensibly avoids the topic and sticks to thinking.

PaulDavisThe1st6mo ago

> Conflates consciousness with "thinking"

I don't see it. Got a quote that demonstrates this?

thechao6mo ago

I'm not really onboard with the whole LLM's-are-conscious thing. OTOH, I am totally onboard with the whole "homo sapiens exterminated every other intelligent hominid and maybe — just maybe — we're not very nice to other intelligences". So, I try not to let my inborn genetic predisposition to exterminate other intelligence pseudo-hominids color my opinions too much.

1 more reply

adamzwasserman6mo ago

Exactly. Notable by its absence.

KoolKat236mo ago

1. Consciousness itself is probably just an illusion, a phenomena/name of something that occurs when you bunch thinking together. Think of this objectively and base it on what we know of the brain. It literally is working off of what hardware we have, there's no magic.

2. That's just a well adapted neural network (I suspect more brain is left than you let on). Multimodal model making the most of its limited compute and whatever gpio it has.

3. Humans navigate a pre-existing map that is already built. We can't understand things in other dimensions and need to abstract this. We're mediocre at computation.

I know there's people that like to think humans should always be special.

adamzwasserman6mo ago

1. 'Probably just an illusion' is doing heavy lifting here. Either provide evidence or admit this is speculation. You can't use an unproven claim about consciousness to dismiss concerns about conflating it with text generation.

2. Yes, there are documented cases of people with massive cranial cavities living normal lives. https://x.com/i/status/1728796851456156136. The point isn't that they have 'just enough' brain. it's that massive structural variation doesn't preclude function, which undermines simplistic 'right atomic arrangement = consciousness' claims.

3. You're equivocating. Humans navigate maps built by other humans through language. We also directly interact with physical reality and create new maps from that interaction. LLMs only have access to the maps - they can't taste coffee, stub their toe, or run an experiment. That's the difference.

1 more reply

estearum6mo ago

> Consciousness itself is probably just an illusion

This is a major cop-out. The very concept of "illusion" implies a consciousness (a thing that can be illuded).

I think you've maybe heard that sense of self is an illusion and you're mistakenly applying that to consciousness, which is quite literally the only thing in the universe we can be certain is not an illusion. The existence of one's own consciousness is the only thing they cannot possibly be illuded about (note: the contents of said consciousness are fully up for grabs)

1 more reply

zeroonetwothree6mo ago

Consciousness is an emergent behavior of a model that needs to incorporate its own existence into its predictions (and perhaps to some extent the complex behavior of same-species actors). So whether or not that is an 'illusion' really depends on what you mean by that.

1 more reply

nearbuy6mo ago

Can you explain #2? What does the part of the brain that's primarily for balance and motor control tell us about deep learning?

adamzwasserman6mo ago

My mistake thx. I meant "despite having no, or close to no, brain beyond a cerebellum"

1 more reply

djoldman6mo ago

TFA is a part of what seems like a never-ending series about concepts that lack a useful definition.

"Thinking" and "intelligence" have no testable definition or specification, therefore it's a complete waste of time to suppose that AI is thinking or intelligent.

kbelder6mo ago

Why can't you make the same claim about any other group of humans?

djoldman6mo ago

If you mean, "why can't we say that it's a complete waste of time to suppose that" humans are "thinking or intelligent," then yes, I think it is a complete waste of time!

If there's no testable definition, there's no way to say the statement is true or false, nevermind what the implications may be.

It is the same as saying we're all goblethorpy.

It is an absurd question even in the abstract: "prove that you're thinking" ... yea we all have an idea about what that means but it is untestable and it is why this kind of philosophical assertion gets endlessly debated with no real progress.

1 more reply

yalogin6mo ago

I don't see how we make the jump from current LLMs to AGI. May be it's my limited understanding of the research but current LLMs seem to not have any properties that indicate AGI. Would love to get thoughts from someone that understands it

rhubarbtree6mo ago

Possible candidates we are missing: online learning, embodiment, self direction, long term memory and associated processing (compression etc), the ability to quickly think in tensor space.

tim3336mo ago

I think they are missing "I thought about that and have changed my mind" stuff. GPTs are pre-trained and don't change their weights after, whereas humans do. That seems to be one big part that is missing but could be built in the future.

zeroonetwothree6mo ago

I agree, I think two things are missing from current AI:

1. A model of the world itself (or whatever domain is under discussion). 2. A way to quickly learn and update in response to feedback.

These are probably related to an extent.

beeflet6mo ago

what properties are you looking for?

JonChesterfield6mo ago

The real question is not whether machines think but whether men do.

ProllyInfamous6mo ago

>"Think of how stupid the average person is, then realize that half of them are stupider than that."

—George Carlin (RIP)

I have been discussing both fiction and non-fiction with Perplexity (since early 2023) and Ollama (since early 2025), and what I'm beginning to realize is that most humans really aren't thinking, machines.

j1mr10rd4n6mo ago

Geoffrey Hinton's recent lecture at the Royal Institute[1] is a fascinating watch. His assertion that human use of language being exactly analogous to neural networks with back-propagation really made me think about what LLMs might be able to do, and indeed, what happens in me when I "think". A common objection to LLM "intelligence" is that "they don't know anything". But in turn... what do biological intelligences "know"?

For example, I "know" how to do things like write constructs that make complex collections of programmable switches behave in certain ways, but what do I really "understand"?

I've been "taught" things about quantum mechanics, electrons, semiconductors, transistors, integrated circuits, instruction sets, symbolic logic, state machines, assembly, compilers, high-level-languages, code modules, editors and formatting. I've "learned" more along the way by trial and error. But have I in effect ended up with anything other than an internalised store of concepts and interconnections? (c.f. features and weights).

Richard Sutton takes a different view in an interview with Dwarkesh Patel[2] and asserts that "learning" must include goals and reward functions but his argument seemed less concrete and possibly just a semantic re-labelling.

[1] https://www.youtube.com/watch?v=IkdziSLYzHw [2] https://www.youtube.com/watch?v=21EYKqUsPfg

zeroonetwothree6mo ago

The vast majority of human learning is in constructing a useful model of the external world. This allows you to predict extremely accurate the results of your own actions. To that end, every single human knows a huge amount.

mrob6mo ago

I don't believe LLMs can be conscious during inference because LLM inference is just repeated evaluation of a deterministic [0] pure function. It takes a list of tokens and outputs a set of token probabilities. Any randomness is part of the sampler that selects a token based on the generated probabilities, not the LLM itself.

There is no internal state that persists between tokens [1], so there can be no continuity of consciousness. If it's "alive" in some way it's effectively killed after each token and replaced by a new lifeform. I don't see how consciousness can exist without possibility of change over time. The input tokens (context) can't be enough to give it consciousness because it has no way of knowing if they were generated by itself or by a third party. The sampler mechanism guarantees this: it's always possible that an unlikely token could have been selected by the sampler, so to detect "thought tampering" it would have to simulate itself evaluating all possible partial contexts. Even this takes unreasonable amounts of compute, but it's actually worse because the introspection process would also affect the probabilities generated, so it would have to simulate itself simulating itself, and so on recursively without bound.

It's conceivable that LLMs are conscious during training, but in that case the final weights are effectively its dead body, and inference is like Luigi Galvani poking the frog's legs with electrodes and watching them twitch.

[0] Assuming no race conditions in parallel implementations. llama.cpp is deterministic.

[1] Excluding caching, which is only a speed optimization and doesn't affect results.

lbrandy6mo ago

I have no idea how you can assert what is necessary/sufficient for consciousness in this way. Your comment reads like you believe you understand consciousness far more than I believe anyone actually does.

mrob6mo ago

I believe consciousness needs some kind of mutable internal state because otherwise literally everything is conscious, which makes the concept useless. A rock "computes" a path to fall when you drop it but I don't believe rocks are conscious. Panpsychism is not a common belief.

1 more reply

jdauriemma6mo ago

I don't think the author is saying that LLMs are conscious or alive.

mrweasel6mo ago

It would be kinda hilarious if the result of all this LLM research is that humans are basically just LLMs with more sensors and a long history.

dagss6mo ago

Thinking != consciousness

ffwd6mo ago

I think something that's missing from AI is the ability humans have to combine and think about ANY sequence of patterns as much as we want. A simple example is say I think about a sequence of "banana - car - dog - house". I can if I want to in my mind, replace car with tree, then replace tree with rainbow, then replace rainbow with something else, etc... I can sit and think about random nonsense for as long as I want and create these endless sequences of thoughts.

Now I think when we're trying to reason about a practical problem or whatever, maybe we are doing pattern recognition via probability and so on, and for a lot of things it works OK to just do pattern recognition, for AI as well.

But I'm not sure that pattern recognition and probability works for creating novel interesting ideas all of the time, and I think that humans can create these endless sequences, we stumble upon ideas that are good, whereas an AI can only see the patterns that are in its data. If it can create a pattern that is not in the data and then recognize that pattern as novel or interesting in some way, it would still lack the flexibility of humans I think, but it would be interesting nevertheless.

nrclark6mo ago

one possible counter-argument: can you say for sure how your brain is creating those replacement words? When you replace tree with rainbow, does rainbow come to mind because of an unconscious neural mapping between both words and "forest"?

It's entirely possible that our brains are complex pattern matchers, not all that different than an LLM.

ffwd6mo ago

That's a good point and I agree. I'm not a neuroscientist but from what I understand the brain has an associative memory so most likely those patterns we create are associatively connected in the brain.

But I think there is a difference between having an associative memory, and having the capacity to _traverse_ that memory in working memory (conscious thinking). While any particular short sequence of thoughts will be associated in memory, we can still overcome that somewhat by thinking for a long time. I can for example iterate on the sequence in my initial post and make it novel by writing down more and more disparate concepts and deleting the concepts that are closely associated. This will in the end create a more novel sequence that is not associated in my brain I think.

I also think there is the trouble of generating and detecting novel patterns. We know for example that it's not just low probability patterns. There are billions of unique low probability sequences of patterns that have no inherent meaning, so uniqueness itself is not enough to detect them. So how does the brain decide that something is interesting? I do not know.

2 more replies

rifty6mo ago

I think the medium where information transformation happened was for many the only artificial line between what they called processing and what they called thinking. The caveat for others being that thinking is what you do with active awareness, and intuition is what you do otherwise.

That caveat to me is the useful distinction still to ponder.

My point of contention with equivalences to Human thinking still at this point is that AI seems to know more about the world with specificity than any human ever will. Yet it still fails sometimes to be consistent and continuous at thinking from that world where a human wouldn't. Maybe i'm off for this but that feels odd to me if the thinking is truly equivalent.

mnewme6mo ago

The problem is that we use the same words for different things, which I think is risky. We often draw parallels simply because we use terms like “thinking,” “reasoning,” or “memory.”

Most of these comparisons focus on problem-solving or pattern recognition, but humans are capable of much more than that.

What the author left out is that there are many well-known voices in neuroscience who hold completely different views from the one that was cited.

I suppose we’ll have to wait and see what turns out to be true.

mxkopy6mo ago

There’s a way to talk about this stuff already. LLMs can “think” counterfactually on continuous data, just like VAEs [0], and are able to interpolate smoothly between ‘concepts’ or projections of the input data. This is meaningless when the true input space isn’t actually smooth. It’s system I, shallow-nerve psychomotor reflex type of thinking.

What LLMs can’t do is “think” counterfactually on discrete data. This is stuff like counting or adding integers. We can do this very naturally because we can think discretely very naturally, but LLMs are bad at this sort of thing because the underlying assumption behind gradient descent is that everything has a gradient (i.e. is continuous). They need discrete rules to be “burned in” [1] since minor perturbations are possible for and can affect continuous-valued weights.

You can replace “thinking” here with “information processing”. Does an LLM “think” any more or less than say, a computer solving TSP on a very large input? Seeing as we can reduce the former to the latter I wouldn’t say they’re really at all different. It seems like semantics to me.

In either case, counterfactual reasoning is good evidence of causal reasoning, which is typically one part of what we’d like AGI to be able to do (causal reasoning is deductive, the other part is inductive; this could be split into inference/training respectively but the holy grail is having these combined as zero-shot training). Regression is a basic form of counterfactual reasoning, and DL models are basically this. We don’t yet have a meaningful analogue for discrete/logic puzzley type of problems, and this is the area where I’d say that LLMs don’t “think”.

This is somewhat touched on in GEB and I suspect “Fluid Concepts and Creative Analogies” as well.

[0] https://human-interpretable-ai.github.io/assets/pdf/5_Genera...

[1] https://www.sciencedirect.com/science/article/pii/S089360802...

iamleppert6mo ago

If AI were really intelligent and thinking, it ought to be able to be trained on its own output. That's the exact same thing we do. We know that doesn't work.

The obvious answer is the intelligence and structure is located in the data itself. Embeddings and LLMs have given us new tools to manipulate language and are very powerful but should be thought of more as a fancy retrieval system than a real, thinking and introspective intelligence.

Models don't have the ability to train themselves, they can't learn anything new once trained, have no ability of introspection. Most importantly, they don't do anything on their own. They have no wants or desires, and can only do anything meaningful when prompted by a human to do so. It's not like I can spin up an AI and have it figure out what it needs to do on its own or tell me what it wants to do, because it has no wants. The hallmark of intelligence is figuring out what one wants and how to accomplish one's goals without any direction.

Every human and animal that has any kind of intelligence has all the qualities above and more, and removing any of them would cause serious defects in the behavior of that organism. Which makes it preposterous to draw any comparisons when its so obvious that so much is still missing.

dang6mo ago

By HN's own jsomers! (at least we claim him)

https://news.ycombinator.com/user?id=jsomers

tantalor6mo ago

As in, he is a commenter? or affiliated with YC?

dang6mo ago

Not affiliated, just a longstanding community member.

CrackerNews6mo ago

The case against LLM is thinking could be that "backpropagation is a leaky abstraction." Whether LLM is thinking depends on how well the mathematical model is defined. Ultimately, there appears to be a limit to the mathematical model that caps the LLM capacity to think. It is "thinking" at some level, but is it at enough of a significant level that can be integrated into human society according to the hype?

Andrej Karpathy in his interview with Dwarkesh Patel was blunt about the current limitations of LLMs, and that there would need to be further architectural developments. LLMs lack the capacity to dream and distill experience and knowledge learned back into the neurons. Thinking in LLMs at best exist as a "ghost" only in the moment as long as the temporary context remains coherent.

cadamsdotcom6mo ago

Thinking is great for this new type of tool - and we are learning that it’s separable from a need for “model welfare”..

Models are created and destroyed a billion times over - unlike humans who are individuals - so we need feel no guilt and have no qualms creating and destroying model instances to serve our needs.

But “a tool that can think” is a new concept that we will take a while to find its place in society.

multiplegeorges6mo ago

AI is thinking the same way a film's picture actually moves.

It's an illusion that's good enough that our brains accept it and it's a useful tool.

dagss6mo ago

Consider this:

If you just took a time machine 10 years back, and asked people to label activities done by the humans/the human brain as being "thinking" or not...

...I feel rather certain that a lot of those activities that LLM do today we would simply label "thinking" without questioning it further.

Myself I know that 10 years ago I would certainly have labelled an interactive debug loop where Claude adds debug log output, reruns tests, diagnose the log output, and fixes the bug -- all on its own initiative -- to be "thinking".

Lots of comments here discussion what the definition of the word "thinking" is. But it is the advent of AI itself that is making us question that definition at all, and that is kind of a revolution itself.

This question will likely be resolved by us figuring out that the word "thinking" is ill-defined and not useful any longer; and for most people to develop richer vocabularies for different parts of human brain activity and consider some of them to be more "mechanical". It will likely not be resolved by AI getting to a certain "level". AI is so very different to us yet can do so many of the same things, that the words we commonly use start breaking down.

snozolli6mo ago

During the pandemic, I experimented with vaping marijuana to see if I could improve my sleep quality. It worked to a degree, but after a few weeks of nightly use, I began to experience what I think is depersonalization.

I would be walking with friends and talking about our day, while simultaneously thinking, "this isn't actually me doing this, this is just a surface-level interaction being carried out almost by automation." Between that and the realization that I "hallucinate", i.e. misremember things, overestimate my understanding of things, and ruminate on past interactions or hypothetical ones, my feelings have changed regarding what intelligence and consciousness really mean.

I don't think people acknowledge how much of a "shell" we build up around ourselves, and how much time we spend in sort of a conditioned, low-consciousness state.

ACCount376mo ago

Humans don't have this understanding, it seems. That their own "intelligence" isn't magic, isn't infallible, and is flawed in many of the same ways LLMs are.

teaearlgraycold6mo ago

I wish more people could feel this. Having used psychedelics a few times it’s illuminating to finally see the inside of your brain from a different perspective. I often wonder what would happen to the world if everyone had this experience. How many modern humans live their entire lives in the shallow mental states of survival, acceptance, or consumption? How would humanity’s course change if every adult got the gut punch of humility from seeing a slightly more objective reality?

zeroonetwothree6mo ago

One of the notable effects of psychedelics is that you come to strongly believe that everyone should try them and that it will be enlightening for them.

2 more replies

richardatlarge6mo ago

No way does the evolutionary nature of the human brain suggest it's optimally designed for reasoning or thinking, so it's not a great model of how AGI might be engineered. A model. Not the model. We don't think clearly about ourselves, which may be the greatest danger / obstacle ahead?

HarHarVeryFunny6mo ago

The number of people willing to launch into debates about whether LLMs are thinking, intelligent, conscious, etc, without actually defining those terms, never ceases to amaze me.

I'm not sure that "thinking", unlike intelligence, is even that interesting of a concept. It's basically just reasoning/planning (i.e. chained what-if prediction). Sometimes you're reasoning/planning (thinking) what to say, and other times just reasoning/planning to yourself (based on an internal vs external focus).

Of course one can always CHOOSE to make analogies between any two things, in this case the mechanics of what's going on internal to an LLM and a brain, but I'm not sure it's very useful in this case. Using anthropomorphic language to describe LLMs seems more likely to confuse rather than provide any insight, especially since they are built with the sole function of mimicking humans, so you are basically gaslighting yourself if you regard them as actually human-like.

CuriouslyC6mo ago

Plot twist: LLMs are conscious, but their internal conscious experience and the tokens they emit are only loosely correlated. The tokens they emit are their excrement, the process of their digital metabolism on the informational sustenance we provide them.

a3w6mo ago

LLMs still claim that 7.0 is newer than 8.0, i.e. have zero reasoning about what numbers below 12 mean.

Today I tried telling it that my fritz.box has OS 8 installed, but it claimed that the feature will only ship once I installed 7, and not with my older version of 8.

dwa35926mo ago

edited- It really depends on your definition of 'thinking' or 'intelligence'. These are umbrella terms for the biology and physics that we don't understand yet. We don't know how we think, or how cats think or how unicellular bacterias think. We just know that we do, and we have a very loose understanding of it. As a human, you have the freedom to juxtapose that loose understanding on non-living things. In my mind, you are just anthropomorphizing, machines are not thinking.

falcor846mo ago

Sorry for the nitpicking, but that should be "loose". I've seen that mistake/typo often in the opposite direction, as they both have a /u/ sound that is more natural with the "oo" spelling, but I've never seen it in this direction.

neom6mo ago

I wrote about this the other day more fully. I'd suspect sooner rather than later we formalize consciousness as self model coherence. Simply any dynamical state where predictive and reflective layers remain mutually consistent. Machines will exhibit that state, and for operational purposes it will count as consciousness. Philosophers will likely keep arguing, but it makes sense for industry and law to adopt something like "behavioral sentience" as the working definition.

hodgehog116mo ago

Consistency is one aspect, but it is not enough. I believe (and this is somewhat based in other arguments from neuroscience and discussions with alignment researchers) that two more are necessary: compression, which demonstrates algorithmic development; and linear representation capacity, as this is the only way that we really interpret the world, and therefore will only define another as intelligent if it can distill knowledge into the same language that we understand.

1 more reply

scoofy6mo ago

Having gone to academia for multiple degrees in philosophy has caused me to hate the “everyone has an opinion” on MACHINE LEARNING and thinking.

Wittgenstein has a lot to say on people talking about stuff they know they don’t know.

The premise that what happens in the world’s most advanced Markov chain and in what happens in a human’s brain is similar is plausible, but currently unknowable.

Yet the anthropomorphizing is so damn ubiquitous that people are happy to make the same mistake in reasoning over and over.

jdthedisciple6mo ago

The reason it looks like it's thinking is because it's great at reproducing and imitating actual thinking – which was wholly done by us in the first place.

spprashant6mo ago

I think we are getting to point where we are trying to figure how important is human experience to intelligence.

Things we do like sleep, meditate, have fun, listen to music etc. do they add to our intelligence? Do they help us have a consistent world model that we build on everyday?

Will we be able to replicate this is in a artificial neural net which is extremely smart in spurts but does not "enjoy" the world it operates in?

kaycey20226mo ago

I have less and less doubt that these models are intelligent by any definition of the word.

Maybe thinking, or intelligence are quite different from personality. Personality gives us agency, goals, self awareness, likes, dislikes, strengths and weaknesses.

Intelligence, otoh is just the 10000 hours thing, spent without context.

prmph6mo ago

> An A.I smarter than a Nobel prize winner.

I don't even know what this means.

If we assembled the sum total of all published human knowledge on a storage medium and gave a computer the ability to search it extremely well in order to answer any question falling within its domain, there, you would have a Nobel Prize beating "A.I".

But this is as "earth-shattering" (/s) as the idea that human knowledge can be stored outside the brain (on paper, flash drives, etc), or that the answer to complex questions can be deterministic.

And then there is the fact that this Noble winner beating "A.I" is highly unlikely to propound any ground-breaking novel ways of thinking and promote and explain it to general acceptance.

esafak6mo ago

Search is not intelligence, but synthesis is, and LLMs interpolate well. They don't invent new branches of mathematics and science yet.

prmph6mo ago

You really think the reason we revere Nobel Prize winners is because they are good at interpolating?

1 more reply

olliepro6mo ago

Ohio bill in motion to deny AI legal personhood: https://www.legislature.ohio.gov/legislation/136/hb469

tim3336mo ago

But what if it incorporates? Corporations are people, my friend.

zeroonetwothree6mo ago

I know you are making some silly political jab, but the bill would prevent AIs from incorporating, since you'd have to be a person to do that in the first place.

1 more reply

29athrowaway6mo ago

What is thinking, and what is not? what are the finite set of properties that once you remove one it's no longer thinking?

"Thinking" as a concept is just a vague predicate, just like being alive or dead.

jjcm6mo ago

So much of the debate of whether AI can think or not reminds me of this scene from The Next Generation: https://youtu.be/ol2WP0hc0NY

LLMs hit two out of the three criteria already - self awareness and intelligence, but we're in a similar state where defining consciousness is such a blurry metric. I feel like it wont be a binary thing, it'll be a group decision by humanity. I think it will happen in the next decade or two, and regardless of the outcome I'm excited I'll be alive to see it. It'll be such a monumentous achievement by humanity. It will drastically change our perspective on who we are and what our role is in the universe, especially if this new life form surpasses us.

blacksmith_tb6mo ago

Self-awareness is a bold claim, as opposed to the illusion of it. LLMs are very good at responding in a way that suggests there's a self, but I am skeptical that proves much about whether they actually have interior states analogous to what we recognize in humans as selfhood...

aaroninsf6mo ago

_Interior states_ gets into some very murky philosophy of mind very quickly of course.

If you're a non-dualist (like me) concerns about qualia start to shade into the religious/metaphysical thereby becoming not so interesting except to e.g. moral philosophy.

Personally I have a long bet that when natively-multimodal models on the scale of contemporary LLM are widely deployed, their "computation phenomenology" will move the goalposts so far the cultural debate will shift from "they are just parrots?" to the moral crisis of abusing parrots, meaning, these systems will increasingly be understood as having a selfhood with moral value. Non-vegetarians may be no more concerned about the quality of "life" and conditions of such systems than they are about factory farming, but, the question at least will circulate.

Prediction: by the time my kids finish college, assuming it is still a thing, it will be as common to see enthusiastic groups flyering and doing sit-ins etc on behalf of AIs as it is today to see animal rights groups.

ACCount376mo ago

In the purely mechanical sense: LLMs get less self-awareness than humans, but not zero.

It's amazing how much of it they have, really - given that base models aren't encouraged to develop it at all. And yet, post-training doesn't create an LLM's personality from nothing - it reuses what's already there. Even things like metaknowledge, flawed and limited as it is in LLMs, have to trace their origins to the base model somehow.

IvyMike6mo ago

Personally, I feel like human intelligence is "unknown black box" + an LLM.

And the LLM part of our intelligence isn't really thinking.

And some people out there have a very, very small "unknown black box".

petralithic6mo ago

The author should read Blindsight by Peter Watts to understand the difference between thinking and consciousness, because their not understanding so is a fundamental flaw of their argument.

JPLeRouzic6mo ago

Citation:

"These days, her favorite question to ask people is “What is the deepest insight you have gained from ChatGPT?”

“My own answer,” she said, “is that I think it radically demystifies thinking”

rhubarbtree6mo ago

I think it radically demystifies _language generation_ and it seems this is part of the brain’s function too.

So we know how to create a part of the brain using simple techniques, which suggests that intelligence might not be so magical as we think. But thinking, well we still don’t know what that is yet.

It feels like, hey, there is a route to machine intelligence.

The big question is how long is that route. Do we have the ingredients to build a brain with the right architecture? And I’d say “nope”. But I’m not so confident that with half a dozen breakthroughs we’d get there. How many years per breakthrough? Well, it’s been nearly a decade since the last one. So 60 years on that count. But more money is going in and there may be some compounding effect, but it should at least be unlikely someone suddenly produces AGI next year. More likely we stairstep and with each step the estimated window should tighten.

But I really don’t think we know what thinking is.

JKCalhoun6mo ago

That is probably my big takeaway from LLMs as well.

oxag3n6mo ago

> ...like a Joycean inner monologue or the flow of sense memories in a Proustian daydream. Or we might mean reasoning: working through a problem step by step.

Moving goalposts will be mostly associated with AI I think: God -> ASI -> AGI -> inner monologue -> working through a problem step by step.

Why fixating on a single human trait like thinking? The only reason trillions are "invested" into this technology is building a replacement for knowledge workers at scale. We can extend this line of thought and make another article "AI has knowledge", at least in a distilled sense, it knows something, sometimes. Cargo cult...

It's very easy to define what's actually required - a system that can show up in a knowledge worker's environment, join the video call, greet the team and tell about itself, what it learned, and start learning in a vague environment, pull those invisible lines of knowledge that lie between its colleagues, getting better, collaborating, and finally replacing all of them.

sys327686mo ago

Helpful to remember that we humans often say "I think" to mean "I am fairly confident based on my hunch", and in that sense AI is very good at hunching.

embedding-shape6mo ago

Helpful to also remember that humans say "I know" when they empirically might not actually "know" that at all. Not to mention your "know" is different from my "know" and all that, see https://en.wikipedia.org/wiki/Epistemology.

Flere-Imsaho6mo ago

So the question is: when will ai hunching produce better results than humans?

IMHO not too long now given the rate of improvements.

j456mo ago

I like learning from everyone's perspectives.

I also keep in mind when non-tech people talk about how tech works without an understanding of tech.

sesm6mo ago

LLMs, by design, are making plausible guesses.

sonicvroooom6mo ago

vectorized thinking in vectorized context is math.

coding logical abduction into LLMs completely breaks them while humans can perfectly roll with it, albeit it's worth emphasizing that some might need a little help from chemistry or at least not be caught on the wrong foot.

you're welcome, move on.

gen2206mo ago

In some realpolitik/moral sense, does it matter whether it is actually "thinking", or "conscious", or has "autonomy" / "agency" of its own?

What seems to matter more is if enough people believe that Claude has those things.

If people credibly think AI may have those qualities, it behooves them to treat the AI like any other person they have a mostly-texting relationship with.

Not in a utility-maximizing Pascal's Wager sense, but in a humanist sense. If you think Claude is human-like, and treat Claude poorly, it makes you more likely to treat the humans around you (and yourself) poorly.

Conversely if you're able to have a fulfilling, empathetic relationship with Claude, it might help people form fulfilling, mutually-empathetic relationships with the humans around them. Put the opposite way, treating human-like Claude poorly doesn't seem to help the goal of increasing human welfare.

The implications of this idea are kind of interesting: even if you think AI isn't thinking or conscious or whatever, you should probably still be a fan of "AI welfare" if you're merely a fan of that pesky little thing we call "human flourishing".

notanastronaut6mo ago

I know humans have a huge tendency to anthropomorphize inanimate objects and get emotionally attached to them, but watching how people treat inanimate objects is very interesting. I know devices are not alive, cognizant, or having feelings, but by thanking them and being encouraging I'm exercising my empathic and "nice" muscles. It has nothing to do with the object and everything to do with myself.

And then you have the people who go out of their way to be hateful towards them, as if they were alive and deserving of abuse. It's one thing to treat a device like an Alexa as just a tool with no feelings. It is another to outright call it hateful sexist slurs, of which I'm sadly familiar with. This low empathy group scares me the most because with the way they treat objects, well let me just say they're not so nice with other people they think are beneath them, like wait staff or call center employees. I'd go so far and say if the law allowed it they'd be even be violent with those they deem inferior.

Regardless if LLM are thinking or not I feel I get better responses from the models by being polite. Not because they appreciate it or have an awareness, but simply because the data they are trained on includes samples where people who are nice to others get better responses than those who were nasty when asking questions.

Besides, if one day AGI is born into existence, a lot of people will not recognize it as such. There are humans who don't believe other people are sentient (we're all NPCs to them), or even don't believe animals have feelings. We'll have credible experts denying the evidence until it bites us all in the arse. Why wait to act ethically?

rob746mo ago

> Conversely if you're able to have a fulfilling, empathetic relationship with Claude, it might help people form fulfilling, mutually-empathetic relationships with the humans around them.

Well, that's kind of the point: if you have actually used LLMs for any amount of time, you are bound to find out that you can't have a fulfilling, empathetic relationship with them. Even if they offer a convincing simulacrum of a thinking being at first sight, you will soon find out that there's not much underneath. It generates grammatically perfect texts that seem to answer your questions in a polite and knowledgeable way, but it will happily lie to you and hallucinate things out of thin air. LLMs are tools, humans are humans (and animals are animals - IMHO you can have a more fulfilling relationship with a dog or a cat than you can have with an LLM).

1 more reply

iainmerrick6mo ago

I don't have a hot take to add here, but I just wanted to say that this article is terrific. Great insights and detail, great clarity for a general audience without dumbing down the technical content in the slightest. Of course it raises more questions than it answers; that's the point of this kind of thing. It's going to be a really useful reference point on the 2025 state of the art in years to come.

This is some of the best writing on AI since Ted Chiang's "ChatGPT Is a Blurry JPEG of the Web". And that was in the New Yorker too! Might need to get myself a subscription...

brador6mo ago

The other side of the coin is maybe we’re not. And that terrifies all who consider it.

embedding-shape6mo ago

The definitions of all these words have been going back and forward and never reached any 100% consensus anyways, so what one person understands of "thinking", "conscious", "intelligence" and so on seems to be vastly different from another person.

I guess this is why any discussion around this ends up with huge conversations, everyone is talking from their own perspective and understanding, while others have different ones, and we're all talking past each other.

There is a whole field trying to just nail down what "knowledge" actually is/isn't, and those people haven't agreed with each other for the duration of hundreds of years, I'm not confident we'll suddenly get a lot better at this.

I guess ultimately, regardless of what the LLMs do, does it matter? Would we understand them better/worse depending on what the answer would be?

jama2116mo ago

You’ve got the nail on the head. We can’t answer whether AI is conscious etc unless we can agree on a definition of what that means exactly, which we don’t seem to be able to do.

PaulDavisThe1st6mo ago

But that's not the question TFA is about.

2 more replies

nickledave6mo ago

I'm not going to read this -- I don't need to. The replies here are embarrassing enough.

This is what happens when our entire culture revolves around the idea that computer programmers are the most special smartest boys.

If you even entertain even for a second the idea that a computer program that a human wrote is "thinking", then you don't understand basic facts about: (1) computers, (2) humans, and (3) thinking. Our educational system has failed to inoculate you against this laughable idea.

A statistical model of language will always be a statistical model of language, and nothing more.

A computer will never think, because thinking is something that humans do, because it helps them stay alive. Computers will never be alive. Unplug your computer, walk away for ten years, plug it back in. It's fine--the only reason it won't work is planned obsolescence.

No, I don't want to read your reply that one time you wrote a prompt that got ChatGPT to whisper the secrets of the universe into your ear. We've known at least since Joseph Weizenbaum coded up Eliza that humans will think a computer is alive if it talks to them. You are hard-wired to believe that anything that produces language is a human just like you. Seems like it's a bug, not a feature.

Stop commenting on Hacker News, turn off your phone, read this book, and tell all the other sicko freaks in your LessWrong cult to read it too: https://mitpress.mit.edu/9780262551328/a-drive-to-survive/ Then join a Buddhist monastery and spend a lifetime pondering how deeply wrong you were.

stocksinsmocks6mo ago

So A doesn’t X because that’s what B does because Y? I think before you excoriate the hacker news commentariat for their midwittery and hubris that you should proofread these arguments. There are many things that live, but, as we imagined it, do not “think”. Even what defines a “thought“ is an open question.

Also, I ain’t gonna read your coffee table science book.

1 more reply

gilbetron6mo ago

This is one of the least curious posts I've seen on HN. We have been thinking about thinking for millenia, and no, Buddhists don't have it figure out. Nobody does. LLMs are the most significant advancement in "thinking science" in a long, long time. It is clear that they are doing something a lot like thinking, if it is not thinking. They seem to think more than most people I know, including the person I'm responding to.

I think people conflate thinking with sentience, consciousness, and a whole lot of other concerns.

Clearly this website is not for you and your complete lack of curiosity if you call us "sicko freaks".

1 more reply

amarcheschi6mo ago

Sometimes the comments on hn regarding Ai and thinking are... Well, something

The lw vibes are strong, I'm still waiting for Ai to escape and kill us (it will get stuck trying to import a library in python)

1 more reply

thirdtruck6mo ago

As someone who grew up in an evangelical household, learned about pareidolia at a young age in the course of escaping it, and who practices Zen meditation: You nailed it.

1 more reply

0xdeadbeefbabe6mo ago

> Still, no one expects easy answers.

Ahem (as a would-be investor, I am insulted).

jonplackett6mo ago

No idea if this is true or not but I do very much like the animation

procaryote6mo ago

In all these discussions there seems to be an inverse correlation between how well people understand what an LLM does and how much they believe it thinks.

If you don't understand what an LLM does – that it is a machine generating a statistically probable token given a set of other tokens – you have a black box that often sounds smart, and it's pretty natural to equate that to thinking.

ACCount376mo ago

"Next token prediction" is not an answer. It's mental shortcut. An excuse not to think about the implications. An excuse a lot of people are eager to take.

First, autoregressive next token prediction can be Turing complete. This alone should give you a big old pause before you say "can't do X".

Second, "next token prediction" is what happens at an exposed top of an entire iceberg worth of incredibly poorly understood computation. An LLM is made not by humans, but by an inhuman optimization process. No one truly "understands" how an LLM actually works, but many delude themselves into thinking that they do.

And third, the task a base model LLM is trained for - what the optimization process was optimizing for? Text completion. Now, what is text? A product of human thinking expressed in natural language. And the LLM is forced to conform to the shape.

How close does it get in practice to the original?

Not close enough to a full copy, clearly. But close enough that even the flaws of human thinking are often reproduced faithfully.

procaryote6mo ago

> First, autoregressive next token prediction can be Turing complete. This alone should give you a big old pause before you say "can't do X".

Lots of things are Turing complete. We don't usually think they're smart, unless it's the first time we see a computer and have no idea how it works

An LLM is a markov chain mathematically. We can build an LLM with a context window of one token and it's basically a token frequency table. We can make the context window bigger and it becomes better at generating plausible looking text.

Is it possible that beyond becoming better at generating plausible looking text – the expected and observed outcome – it also gains some actual intelligence? It's very hard to disprove, but occam's razor might not be kind to it.

1 more reply

didibus6mo ago

I'd like to remind people not to cargo cult, and the main issue I see with any attempt at saying an LLM is thinking is that we just don't know how human thinking works.

We now understand pretty well how LLMs "think", and I don't know why we want to call it "thinking" when we mean we know how they work. But to say that their architecture and method of generating language amounts to human thinking? When we know very little of how human thinking works?

Like why are we even trying to make such claims? Is it all grift? Is it just because it helps people understand a little how they work in simplistic terms? Is it because it kind of describes the semblance of behavior you can expect from them?

LLMs do exhibit thinking like behavior, because they were trained to learn to do that, but I think we really need to check ourselves with claim of similarity in thinking.

jameswhitford6mo ago

This submarine isn’t swimming, it’s us that are submarining!

I think I hear my master’s voice..

Or is that just a fly trapped in a bottle?

spacecadet6mo ago

Come on people, think about what is actually happening. They are not thinking... Think about what actually goes into the activity of thinking... LLMs, at no point actually do that. They do a little bit special padding and extra layers, but in most cases, every single time... not when needed, not sub-consciously, but dumbly.

Im already drifting off HN, but I swear, if this community gets all wooey and anthropomorphic over AI, Im out.

nxor6mo ago

Does no one care that LLM's have fewer 'neurons' than for example a cat?

Gud6mo ago

Why would that even matter? Why is having neurons a criteria for thinking?

1 more reply

educasean6mo ago

The debate around whether or not transformer-architecture-based AIs can "think" or not is so exhausting and I'm over it.

What's much more interesting is the question of "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?". Otherwise we go in endless circles about language and meaning of words instead of discussing practical, demonstrable capabilities.

Symmetry6mo ago

"The question of whether a computer can think is no more interesting than the question of whether a submarine can swim." - Edsger Dijkstra

oergiR6mo ago

There is more to this quote than you might think.

Grammatically, in English the verb "swim" requires an "animate subject", i.e. a living being, like a human or an animal. So the question of whether a submarine can swim is about grammar. In Russian (IIRC), submarines can swim just fine, because the verb does not have this animacy requirement. Crucially, the question is not about whether or how a submarine propels itself.

Likewise, in English at least, the verb "think" requires an animate object. the question whether a machine can think is about whether you consider it to be alive. Again, whether or how the machine generates its output is not material to the question.

1 more reply

viccis6mo ago

He was famously (and, I'm realizing more and more, correctly) averse to anthropomorphizing computing concepts.

pegasus6mo ago

I disagree. The question is really about weather inference is in principle as powerful as human thinking, and so would deserve to be applied the same label. Which is not at all a boring question. It's equivalent to asking weather current architectures are enough to reach AGI (I myself doubt this).

esafak6mo ago

I think it is, though, because it challenges our belief that only biological entities can think, and thinking is a core part of our identity, unlike swimming.

2 more replies

handfuloflight6mo ago

What an oversimplification. Thinking computers can create more swimming submarines, but the inverse is not possible. Swimming is a closed solution; thinking is a meta-solution.

3 more replies

tjr6mo ago

Without going to look up the exact quote, I remember an AI researcher years (decades) ago saying something to the effect of, Biologists look at living creatures and wonder how they can be alive; astronomers look at the cosmos and wonder what else is out there; those of us in artificial intelligence look at computer systems and wonder how they can be made to wonder such things.

paxys6mo ago

Don't be sycophantic. Disagree and push back when appropriate.

Come up with original thought and original ideas.

Have long term goals that aren't programmed by an external source.

Do something unprompted.

The last one IMO is more complex than the rest, because LLMs are fundamentally autocomplete machines. But what happens if you don't give them any prompt? Can they spontaneously come up with something, anything, without any external input?

BeetleB6mo ago

> Disagree and push back

The other day an LLM gave me a script that had undeclared identifiers (it hallucinated a constant from an import).

When I informed it, it said "You must have copy/pasted incorrectly."

When I pushed back, it said "Now you trust me: The script is perfectly correct. You should look into whether there is a problem with the installation/config on your computer."

3 more replies

IanCal6mo ago

> Don't be sycophantic. Disagree and push back when appropriate.

They can do this though.

> Can they spontaneously come up with something, anything, without any external input?

I don’t see any why not, but then humans don’t have zero input so I’m not sure why that’s useful.

1 more reply

jackcviers36mo ago

The last one is fairly simple to solve. Set up a microphone in any busy location where conversations are occurring. In an agentic loop, send random snippets of audio recordings for transcriptions to be converted to text. Randomly send that to an llm, appending to a conversational context. Then, also hook up a chat interface to discuss topics with the output from the llm. The random background noise and the context output in response serves as a confounding internal dialog to the conversation it is having with the user via the chat interface. It will affect the outputs in response to the user.

If it interrupts the user chain of thought with random questions about what it is hearing in the background, etc. If given tools for web search or generating an image, it might do unprompted things. Of course, this is a trick, but you could argue that any sensory input living sentient beings are also the same sort of trick, I think.

I think the conversation will derail pretty quickly, but it would be interesting to see how uncontrolled input had an impact on the chat.

1 more reply

awestroke6mo ago

Are you claiming humans do anything unprompted? Our biology prompts us to act

1 more reply

gwd6mo ago

> The last one IMO is more complex than the rest, because LLMs are fundamentally autocomplete machines. But what happens if you don't give them any prompt? Can they spontaneously come up with something, anything, without any external input?

Human children typically spend 18 years of their lives being RLHF'd before let them loose. How many people do something truly out of the bounds of the "prompting" they've received during that time?

khafra6mo ago

Note that model sycophancy is caused by RLHF. In other words: Imagine taking a human in his formative years, and spending several subjective years rewarding him for sycophantic behavior and punishing him for candid, well-calibrated responses.

Now, convince him not to be sycophantic. You have up to a few thousand words of verbal reassurance to do this with, and you cannot reward or punish him directly. Good luck.

omnicognate6mo ago

> "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?"

Independent frontier maths research, i.e. coming up with and proving (preferably numerous) significant new theorems without human guidance.

I say that not because I think the task is special among human behaviours. I think the mental faculties that mathematicians use to do such research are qualitatively the same ones all humans use in a wide range of behaviours that AI struggles to emulate.

I say it because it's both achievable (in principle, if LLMs can indeed think like humans) and verifiable. Achievable because it can be viewed as a pure text generation task and verifiable because we have well-established, robust ways of establishing the veracity, novelty and significance of mathematical claims.

It needs to be frontier research maths because that requires genuinely novel insights. I don't consider tasks like IMO questions a substitute as they involve extremely well trodden areas of maths so the possibility of an answer being reachable without new insight (by interpolating/recombining from vast training data) can't be excluded.

If this happens I will change my view on whether LLMs think like humans. Currently I don't think they do.

pegasus6mo ago

This, so much. Many mathematicians and physicists believe in intuition as a function separate from intelect. One is more akin to a form of (inner) perception, whereas the other is generative - extrapolation based on pattern matching and statistical thinking. That second function we have a handle on and getting better at it every year, but we don't even know how to define intuition properly. A fascinating book that discusses this phenomena is Nature Loves to Hide: Quantum Physics and Reality, a Western Perspective [1]

This quote from Grothendieck [2] (considered by many the greatest mathematician of the 20th century) points to a similar distinction: The mathematician who seeks to understand a difficult problem is like someone faced with a hard nut. There are two ways to go about it. The one way is to use a hammer — to smash the nut open by brute force. The other way is to soak it gently, patiently, for a long time, until it softens and opens of itself.

[1] https://www.amazon.com/Nature-Loves-Hide-Quantum-Perspective...

[2] https://en.wikipedia.org/wiki/Alexander_Grothendieck

tim3336mo ago

That's quite a high bar for thinking like humans which rules out 99.99% of humans.

1 more reply

OrderlyTiamat6mo ago

Google's AlphaEvolve independently discovered a novel matrix multiplication algorithm which beats SOTA on at least one axis: https://www.youtube.com/watch?v=sGCmu7YKgPA

1 more reply

amarant6mo ago

solve simple maths problems, for example the kind found in the game 4=10 [1]

Doesn't necessarily have to reliably solve them, some of them are quite difficult, but llms are just comically bad at this kind of thing.

Any kind of novel-ish(can't just find the answers in the training-data) logic puzzle like this is, in my opinion, a fairly good benchmark for "thinking".

Until a llm can compete with a 10 year old child in this kind of task, I'd argue that it's not yet "thinking". A thinking computer ought to be at least that good at maths after all.

[1] https://play.google.com/store/apps/details?id=app.fourequals...

simonw6mo ago

> solve simple maths problems, for example the kind found in the game 4=10

I'm pretty sure that's been solved for almost 12 months now - the current generation "reasoning" models are really good at those kinds of problems.

1 more reply

xienze6mo ago

> "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?"

Invent some novel concept, much the same way scientists and mathematicians of the distant past did? I doubt Newton's brain was simply churning out a stream of the "next statistically probable token" until -- boom! Calculus. There was clearly a higher order understanding of many abstract concepts, intuition, and random thoughts that occurred in his brain in order to produce something entirely new.

danielbln6mo ago

My 5 year old won't be coming up with novel concepts around calculus either, yet she's clearly thinking, sentient and sapient. Not sure taking the best of the best of humanity as the goal standard is useful for that definition.

1 more reply

hshdhdhj44446mo ago

> Newton's brain was simply churning out a stream of the "next statistically probable token"

At some level we know human thinking is just electrons and atoms flowing. It’s likely at a level between that and “Boom! Calculus”, the complexity is equivalent to streaming the next statistically probably token.

plufz6mo ago

Have needs and feelings? (I mean we can’t KNOW that they don’t and we know of this case of an LLM in experiment that try to avoid being shutdown, but I think the evidence of feeling seems weak so far)

jstanley6mo ago

But you can have needs and feelings even without doing thinking. It's separate.

1 more reply

bloppe6mo ago

Ya, the fact this was published on November 3, 2025 is pretty hilarious. This was last year's debate.

I think the best avenue toward actually answering your questions starts with OpenWorm [1]. I helped out in a Connectomics research lab in college. The technological and epistemic hurdles are pretty daunting, but so were those for Genomics last century, and now full-genome sequencing is cheap and our understanding of various genes is improving at an accelerating pace. If we can "just" accurately simulate a natural mammalian brain on a molecular level using supercomputers, I think people would finally agree that we've achieved a truly thinking machine.

[1]: https://archive.ph/0j2Jp

9rx6mo ago

> Otherwise we go in endless circles about language and meaning of words

We understand thinking as being some kind of process. The problem is that we don't understand the exact process, so when we have these discussions the question is if LLMs are using the same process or an entirely different process.

> instead of discussing practical, demonstrable capabilities.

This doesn't resolve anything as you can reach the same outcome using a different process. It is quite possible that LLMs can do everything a thinking entity can do all without thinking. Or maybe they actually are thinking. We don't know — but many would like to know.

zer00eyz6mo ago

> That is something that only an actually thinking entity can do that LLMs can't?

Training != Learning.

If a new physics breakthrough happens tomorrow, one that say lets us have FTL, how is an LLM going to acquire the knowledge, how does that differ from you.

The break through paper alone isnt going to be enough to over ride its foundational knowledge in a new training run. You would need enough source documents and a clear path deprecate the old ones...

anon2916mo ago

The issue is that we have no means of discussing equality without tossing out the first order logic that most people are accustomed to. Human equality and our own perceptions of other humans as thinking machines is an axiomatic assumption that humans make due to our mind's inner sense perception.

xnx6mo ago

> what is something that only an actually thinking entity can do that LLMs can't?

This is pretty much exactly what https://arcprize.org/arc-agi is working on.

0x20cowboy6mo ago

See https://arcprize.org/

deadbabe6mo ago

Form ideas without the use of language.

For example: imagining how you would organize a cluttered room.

Chabsff6mo ago

Ok, but how do you go about measuring whether a black-box is doing that or not?

We don't apply that criteria when evaluating animal intelligence. We sort of take it for granted that humans at large do that, but not via any test that would satisfy an alien.

Why should we be imposing white-box constraints to machine intelligence when we can't do so for any other?

1 more reply

embedding-shape6mo ago

> Form ideas without the use of language.

Don't LLMs already do that? "Language" is just something we've added as a later step in order to understand what they're "saying" and "communicate" with them, otherwise they're just dealing with floats with different values, in different layers, essentially (and grossly over-simplified of course).

2 more replies

tim3336mo ago

Genie 3 is along the lines of ideas without language. It doesn't declutter though, I think. https://youtu.be/PDKhUknuQDg

gf0006mo ago

What people are interested in is finding a definition for intelligence, that is an exact boundary.

That's why we first considered tool use, being able to plan ahead as intelligence, until we have found that these are not all that rare in the animal kingdom in some shape. Then with the advent of IT what we imagined as impossible turned out to be feasible to solve, while what we though of as easy (e.g. robot movements - a "dumb animal" can move trivially it surely is not hard) turned out to require many decades until we could somewhat imitate.

So the goal post moving of what AI is is.. not moving the goal post. It's not hard to state trivial higher bounds that differentiates human intelligence from anything known to us, like invention of the atomic bomb. LLMs are nowhere near that kind of invention and reasoning capabilities.

paulhebert6mo ago

Interestingly, I think the distinction between human and animal thinking is much more arbitrary than the distinction between humans and LLMs.

Although an LLM can mimic a human well, I’d wager the processes going on in a crow’s brain are much closer to ours than an LLM

Balinares6mo ago

Strive for independence.

mehdibl6mo ago

We are still having to read this again in 2025? Some will never get it.

adverbly6mo ago

So happy to see Hofstadter referenced!

He's the GOAT in my opinion for "thinking about thinking".

My own thinking on this is that AI actually IS thinking - but its like the MVB of thinking (minimum viable brain)

I find thought experiments the best for this sort of thing:

- Imagine you had long term memory loss so couldn't remember back very long