The setup
The model is smart. It's just not informed.
Two different things get jammed together in most people's heads: how capable a model is, and how much it knows about you. A modern LLM is enormously capable — it can write, summarize, reason through a problem, draft a quote. But its knowledge is frozen and general. It learned from text that existed before its training cutoff, and your business was almost certainly not in it. Your prices, your warranty terms, last month's job notes, the quirks of your three biggest customers — none of that was ever on the public internet, and the parts that were are now out of date.
So when you ask a raw model a question about your operation, it can't look anything up. It generates the most plausible-sounding answer from its general training — and "plausible" and "true for your business" are not the same thing. That's the gap RAG closes. It doesn't make the model smarter. It makes the model informed, by bolting a real information source onto it.
A carpenter analogy: a master framer dropped on a job site he's never seen is still a master framer — but he can't tell you where this building's load-bearing walls are until someone hands him the prints. RAG is handing the model the prints.
"A raw LLM is a sharp consultant with amnesia about your company. RAG is the binder you slide across the desk before you ask the question."