At least to me, the difference is that one is ostensibly an explanation of how t...

gpm · 2025-01-21T00:55:14 1737420914

You can be pretty sure the AI isn't doing a post-hoc explanation because the only writable memory it has access to is the tokens it has already output - i.e. the explanation of the joke. Everything else is reset between every token.

As long as it comes up with different jokes different times you ask it (assuming randomness in sampling) - how could it.

baq · 2025-01-21T06:26:09 1737440769

The problem is it can’t remember what it hasn’t written but the end result still makes sense, so there has to be some goal after parsing the initial context that the tokens are emitted towards to. This means there’s nothing stopping it from producing an explanation, it might be in there from the very start.

gpm · 2025-01-21T19:25:50 1737487550

Not one goal though, but a set of goals. Otherwise the random sampling of tokens wouldn't result in it outputting meaningfully different jokes.

I also think it's safe to assume that the set of goals isn't fully resolved, but it's harder to "prove" that.

svachalek · 2025-01-21T15:20:59 1737472859

There's no goal. The tokens are computed one at a time from all the previous tokens.

baq · 2025-01-21T16:33:21 1737477201

One is orthogonal to the other.