The LLM has an internal "confidence score" but that has NOTHING to do with how c...

sharperguy · 2026-03-01T10:20:39 1772360439

Still, it might be interesting information to have access to, as someone running the model? Normally we are reading the output trying to build an intuition for the kinds of patterns it outputs when it's hallucinating vs creating something that happens to align with reality. Adding in this could just help with that even when it isn't always correlated to reality itself.

alexwebb2 · 2026-03-01T10:06:50 1772359610

Huge leap there in your conclusion. Looks like you’re hand-waving away the entire phenomenon of emergent properties.

amelius · 2026-03-01T11:36:30 1772364990

> In short: LLM have no concept, or even desire to produce of truth

They do produce true statements most of the time, though.

jaen · 2026-03-01T11:51:30 1772365890

That's just because true statements are more likely to occur in their training corpus.

amelius · 2026-03-01T12:42:45 1772368965

The training set is far too small for that to explain it.

Try to explain why one shotting works.

jaen · 2026-03-01T14:58:18 1772377098

Uh, to explain what? You probably read something into what I said while I was being very literal.

If you train an LLM on mostly false statements, it will generate both known and novel falsehoods. Same for truth.

An LLM has no intrinsic concept of true or false, everything is a function of the training set. It just generates statements similar to what it has seen and higher-dimensional analogies of those .

red75prime · 2026-03-02T04:38:31 1772426311

Reasoning allows to produce statements that are more likely to be true based on statements that are known to be true. You'd need to structure your "falsehood training data" in a specific way to allow an LLM to generalize as well as with the regular data (instead of memorizing noise). And then you'll get a reasoning model which remembers false premises.

You generate your text based on a "stochastic parrot" hypothesis with no post-validation it seems.

jaen · 2026-03-02T09:29:27 1772443767

Really, how hard is it to follow HN guidelines and :

a) not imagine straw-man arguments and not imagine more (or less) than what was said

b) refrain from snarky and false ad hominems

None of what you said in no way conflicts with what I said, and again shows a fundamental misunderstanding.

Reasoning is (mostly) part of the post-training dataset. If you add a large majority of false (ie. paradoxical, irrational etc.) reasoning traces to those, you'll get a model that successfully replicates the false reasoning of humans. If you mix it in with true reasoning traces, I imagine you'll get infinite loop behaviour as the reasoning trace oscillates between the true and the false.

The original premise that truth is purely a function of the training dataset still stands... I'm not even sure what people are arguing here, as that seems quite trivially obvious?

red75prime · 2026-03-04T06:43:26 1772606606

Ah, sorry. I haven't recognized "all the high-level capabilities of an LLM come from the training data (presumably unlike humans, given the context of this thread)" in your wording. This is probably true. LLM structure probably has no inherent inductive bias that would amount to truth seeking. If you want to get a useless LLM, you can do it. OK, no disagreement here.

red75prime · 2026-03-01T21:16:27 1772399787

The overwhelming majority of true statements isn't in the training corpus due to a combinatorial explosion. What it means that they are more likely to occur there?