The answer
What an LLM actually is.
An LLM is a prediction machine. You give it some text. It predicts what chunk of text most likely comes next, writes that down, then takes everything so far and predicts the next chunk. And the next. Until it stops. That's it. That's the whole thing.
It is not thinking. It is not reasoning. It is not "looking things up." It generates the next most-probable piece of text given everything before it. The fact that this produces something that sounds like thinking is a measure of how good the predictions are — not evidence that thinking is happening underneath.
The most honest shorthand is the one engineers use: auto-complete on steroids. Your phone guesses "soon" after you type "I'll be home" because it has seen the pattern. An LLM is the same idea, blown up by about six orders of magnitude — trained not on your texts but on a huge chunk of the public internet, books, and code; guessing the next chunk not from your last three words but from the last several thousand. Better data, more context, more compute. Same core operation: predict the next chunk.
One thing it did not do: learn facts the way you learn them. It never read a textbook, verified a source, or updated a belief when the world changed. It saw an enormous amount of text and got frighteningly good at predicting text that looks like it. That single fact is the seed of everything weird about how it behaves.