undefined | Better HN

0 pointsfamouswaffles2y ago0 comments

automatically comparing drafts for every single query would be expensive.

and that wouldn't eliminate hallucinations just tell you if large details have likely been hallucinated.

But it's a method some research has used.

0 comments

How expensive could it be? Google Bard, a free service, offers the drafts for free. Just do the comparison on the user’s machine if the LLM provider is that cheap.

P.S. Also aren’t LLMs deterministic if you set their “temperature” to zero? Are there drafts if the temperature is zero? If not, then that’s the same as removing the randomness no?

famouswafflesOP2y ago

The drafts have to be evaluated either by a human or llm. Doing that for every request does not scale when you have millions of users.

>Just do the comparison on the user’s machine if the LLM provider is that cheap.

This is not possible. Users don't have the resources to run these gigantic models. LLM inference is not cheap. Open ai, Google aren't running profit on free cGPT or Bard.

>P.S. Also aren’t LLMs deterministic if you set their “temperature” to zero? Are there drafts if the temperature is zero? If not, then that’s the same as removing the randomness no?

It's not a problem of randomness. a temp of 0 doesn't reduce hallucinations. LLMs internally know when they are hallucinating/taking a wild guess. randomness influences how that guess manifests each time but the decision to guess was already made.

https://arxiv.org/abs/2304.13734

TerrifiedMouse2y ago

> a temp of 0 doesn't reduce hallucinations.

I never said it did.

> LLMs internally know when they are hallucinating/taking a wild guess.

No they don’t. If they did we would be able to program them to not do so.

I would argue that wild guesses are all LLMs are doing. They practically statistically guess their way to an answer. It works surprisingly well a lot of the time but they don’t really understand why they are right/wrong.

P.S. LLMs are kind of like students who didn’t study for the test so they use “heuristics” to guess the answer. If the test setter is predictable enough, the student might actually get a few right.

j / k navigate · click thread line to collapse

0 comments

TerrifiedMouse2y ago

How expensive could it be? Google Bard, a free service, offers the drafts for free. Just do the comparison on the user’s machine if the LLM provider is that cheap.

P.S. Also aren’t LLMs deterministic if you set their “temperature” to zero? Are there drafts if the temperature is zero? If not, then that’s the same as removing the randomness no?

famouswafflesOP2y ago

The drafts have to be evaluated either by a human or llm. Doing that for every request does not scale when you have millions of users.

>Just do the comparison on the user’s machine if the LLM provider is that cheap.

This is not possible. Users don't have the resources to run these gigantic models. LLM inference is not cheap. Open ai, Google aren't running profit on free cGPT or Bard.

>P.S. Also aren’t LLMs deterministic if you set their “temperature” to zero? Are there drafts if the temperature is zero? If not, then that’s the same as removing the randomness no?

https://arxiv.org/abs/2304.13734

TerrifiedMouse2y ago

> a temp of 0 doesn't reduce hallucinations.

I never said it did.

> LLMs internally know when they are hallucinating/taking a wild guess.

No they don’t. If they did we would be able to program them to not do so.

j / k navigate · click thread line to collapse