undefined | Better HN

0 pointsstacktrace3mo ago0 comments

> It's still a big issue that the models will make up plausible sounding but wrong or misleading explanations for things, and verifying their claims ends up taking time. And if it's a topic you don't care about enough, you might just end up misinformed.

Exactly! One important thing LLMs have made me realise deeply is "No information" is better than false information. The way LLMs pull out completely incorrect explanations baffles me - I suppose that's expected since in the end it's generating tokens based on its training and it's reasonable it might hallucinate some stuff, but knowing this doesn't ease any of my frustration.

IMO if LLMs need to focus on anything right now, they should focus on better grounding. Maybe even something like a probability/confidence score, might end up experience so much better for so many users like me.

0 comments

biofox3mo ago

I ask for confidence scores in my custom instructions / prompts, and LLMs do surprisingly well at estimating their own knowledge most of the time.

EastLondonCoder3mo ago

I’m with the people pushing back on the “confidence scores” framing, but I think the deeper issue is that we’re still stuck in the wrong mental model.

It’s tempting to think of a language model as a shallow search engine that happens to output text, but that metaphor doesn’t actually match what’s happening under the hood. A model doesn’t “know” facts or measure uncertainty in a Bayesian sense. All it really does is traverse a high‑dimensional statistical manifold of language usage, trying to produce the most plausible continuation.

That’s why a confidence number that looks sensible can still be as made up as the underlying output, because both are just sequences of tokens tied to trained patterns, not anchored truth values. If you want truth, you want something that couples probability distributions to real world evidence sources and flags when it doesn’t have enough grounding to answer, ideally with explicit uncertainty, not hand‑waviness.

People talk about hallucination like it’s a bug that can be patched at the surface level. I think it’s actually a feature of the architecture we’re using: generating plausible continuations by design. You have to change the shape of the model or augment it with tooling that directly references verified knowledge sources before you get reliability that matters.

kznewman3mo ago

Solid agree. Hallucination for me IS the LLM use case. What I am looking for are ideas that may or may not be true that I have not considered and then I go try to find out which I can use and why.

1 more reply

coldtea3mo ago

>A model doesn’t “know” facts or measure uncertainty in a Bayesian sense. All it really does is traverse a high‑dimensional statistical manifold of language usage, trying to produce the most plausible continuation.

And is that that different than what we do under the scenes? Is there a difference between an actual fact vs some false information stored in our brain? Or both have the same representation in some kind of high‑dimensional statistical manifold in our brains, and we also "try to produce the most plausible continuation" using them?

There might be one major difference is at a different level: what we're fed (read, see, hear, etc) we also evaluate before storing. Does LLM training do that, beyond some kind of manually assigned crude "confidence tiers" applied to input material during training (e.g. trust Wikipedia more than Reddit threads)?

1 more reply

tsunamifury3mo ago

Hallucinations are a feature of reality that LLMs have inherited.

It’s amazing that experts like yourself who have a good grasp of the manifold MoE configuration don’t get that.

LLMs much like humans weight high dimensionality across the entire model then manifold then string together an attentive answer best weighted.

Just like your doctor occasionally giving you wrong advice too quickly so does this sometimes either get confused by lighting up too much of the manifold or having insufficient expertise.

3 more replies

airstrike3mo ago

It's not even a manifold https://arxiv.org/abs/2504.01002

wan233mo ago

A different way to look at it is language models do know things, but the contents of their own knowledge is not one of those things.

paulddraper3mo ago

You have a subtle slight of hand.

You use the word “plausible” instead of “correct.”

2 more replies

JAlexoid3mo ago

I mean... That is exactly how our memory works. So in a sense, the factually incorrect information coming from LLM is as reliable as someone telling you things from memory.

1 more reply

drclau3mo ago

How do you know the confidence scores are not hallucinated as well?

kiliankoe3mo ago

They are, the model has no inherent knowledge about its confidence levels, it just adds plausible-sounding numbers. Obviously they _can_ be plausible, but trusting these is just another level up from trusting the original output.

I read a comment here a few weeks back that LLMs always hallucinate, but we sometimes get lucky when the hallucinations match up with reality. I've been thinking about that a lot lately.

2 more replies

dfsegoat3mo ago

they 100% are unless you provide a RUBRIC / basically make it ordinal.

"Return a score of 0.0 if ...., Return a score of 0.5 if .... , Return a score of 1.0 if ..."

ryoshu3mo ago

LLMs fail at causal accuracy. It's a fundamental problem with how they work.

kromokromo3mo ago

Asking an LLM to give itself a «confidence score» is like asking a teenager to grade his own exam. I LLMs doesn’t «feel» uncertainty and confidence like we do.

robocat3mo ago

> wrong or misleading explanations

Exactly the same issue occurs with search.

Unfortunately not everybody knows to mistrust AI responses, or have the skills to double-check information.

darkwater3mo ago

No, it's not the same. Search results send/show you one or more specific pages/websites. And each website has a different trust factor. Yes, plenty of people repeat things they "read on the Internet" as truths, but it's easy to debunk some of them just based on the site reputation. With AI responses, the reputation is shared with the good answers as well, because they do give good answers most of the time, but also hallucinate errors.

SebastianSosa13mo ago

Community notes on X seems to be one of the highest profile recent experiments trying to address this issue

1 more reply

incrudible3mo ago

If somebody asks a question on Stackoverflow, it is unlikely that a human who does not know the answer will take time out of their day to completely fabricate a plausible sounding answer.

jaxn3mo ago

People are confidently incorrect all the time. It is very likely that people will make up plausible sounding answers on StackOverflow.

You and I have both taken time out of our days to write plausible sounding answers that are essentially opposing hallucinations.

1 more reply

balder19913mo ago

At least it used to be true.

JAlexoid3mo ago

Have you ever heard of Dunning Kruger effect?

There's a reason why there are upvotes, solution and third party edit system in StackOverflow - people will spend time to write their "hallucinations" very confidently.

lins19093mo ago

What is it about people making up lies to defend LLMs? In what world is it exactly the same as search? They're literally different things, since you get information from multiple sources and can do your own filtering.

actionfromafar3mo ago

I wonder if the only way to fix this with current LLMs, would be to generate a lot synthetic data for a select number topics you really don't want it "go off the rails" with. That synthetic data would be lots of variations on that "I don't know how to do X with Y".

dolmen3mo ago

I would not bet on synthetic data.

LLMs are very good at detecting patterns.

RHSman23mo ago

The problem is not the intelligence of the LLM. It is the intelligence and desire to make things easy of the intelligence using them.

XCSme3mo ago

But most benchmarks are not about that...

Are there even any "hallucination" public benchmarks?

andrepd3mo ago

"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.

XCSme3mo ago

I would assume a good benchmark has hidden tests, or something randomly generated that is harder to game

basisword3mo ago

I think the thing even worse than false information is the almost-correct information. You do a quick Google to confirm it's on the right page but find there's an important misunderstanding. These are so much harder to spot I think than the blatantly false.

j / k navigate · click thread line to collapse

0 comments

biofox3mo ago

I ask for confidence scores in my custom instructions / prompts, and LLMs do surprisingly well at estimating their own knowledge most of the time.

EastLondonCoder3mo ago

I’m with the people pushing back on the “confidence scores” framing, but I think the deeper issue is that we’re still stuck in the wrong mental model.

kznewman3mo ago

Solid agree. Hallucination for me IS the LLM use case. What I am looking for are ideas that may or may not be true that I have not considered and then I go try to find out which I can use and why.

1 more reply

coldtea3mo ago

1 more reply

tsunamifury3mo ago

Hallucinations are a feature of reality that LLMs have inherited.

It’s amazing that experts like yourself who have a good grasp of the manifold MoE configuration don’t get that.

LLMs much like humans weight high dimensionality across the entire model then manifold then string together an attentive answer best weighted.

Just like your doctor occasionally giving you wrong advice too quickly so does this sometimes either get confused by lighting up too much of the manifold or having insufficient expertise.

3 more replies

airstrike3mo ago

It's not even a manifold https://arxiv.org/abs/2504.01002

wan233mo ago

A different way to look at it is language models do know things, but the contents of their own knowledge is not one of those things.

paulddraper3mo ago

You have a subtle slight of hand.

You use the word “plausible” instead of “correct.”

2 more replies

JAlexoid3mo ago

I mean... That is exactly how our memory works. So in a sense, the factually incorrect information coming from LLM is as reliable as someone telling you things from memory.

1 more reply

drclau3mo ago

How do you know the confidence scores are not hallucinated as well?

kiliankoe3mo ago

I read a comment here a few weeks back that LLMs always hallucinate, but we sometimes get lucky when the hallucinations match up with reality. I've been thinking about that a lot lately.

2 more replies

dfsegoat3mo ago

they 100% are unless you provide a RUBRIC / basically make it ordinal.

"Return a score of 0.0 if ...., Return a score of 0.5 if .... , Return a score of 1.0 if ..."

ryoshu3mo ago

LLMs fail at causal accuracy. It's a fundamental problem with how they work.

kromokromo3mo ago

Asking an LLM to give itself a «confidence score» is like asking a teenager to grade his own exam. I LLMs doesn’t «feel» uncertainty and confidence like we do.

robocat3mo ago

> wrong or misleading explanations

Exactly the same issue occurs with search.

Unfortunately not everybody knows to mistrust AI responses, or have the skills to double-check information.

darkwater3mo ago

SebastianSosa13mo ago

Community notes on X seems to be one of the highest profile recent experiments trying to address this issue

1 more reply

incrudible3mo ago

If somebody asks a question on Stackoverflow, it is unlikely that a human who does not know the answer will take time out of their day to completely fabricate a plausible sounding answer.

jaxn3mo ago

People are confidently incorrect all the time. It is very likely that people will make up plausible sounding answers on StackOverflow.

You and I have both taken time out of our days to write plausible sounding answers that are essentially opposing hallucinations.

1 more reply

balder19913mo ago

At least it used to be true.

JAlexoid3mo ago

Have you ever heard of Dunning Kruger effect?

There's a reason why there are upvotes, solution and third party edit system in StackOverflow - people will spend time to write their "hallucinations" very confidently.

lins19093mo ago

actionfromafar3mo ago

dolmen3mo ago

I would not bet on synthetic data.

LLMs are very good at detecting patterns.

RHSman23mo ago

The problem is not the intelligence of the LLM. It is the intelligence and desire to make things easy of the intelligence using them.

XCSme3mo ago

But most benchmarks are not about that...

Are there even any "hallucination" public benchmarks?

andrepd3mo ago

"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.

XCSme3mo ago

I would assume a good benchmark has hidden tests, or something randomly generated that is harder to game

basisword3mo ago

j / k navigate · click thread line to collapse