Question 1

Why do language models hallucinate at all?

Accepted Answer

They optimize for probable text, not true text. Training rewards predicting the next token correctly, which often coincides with truth but is not the same goal. When the model encounters a question outside its reliable knowledge, the most probable continuation is a confident-sounding answer, because confident answers are what the training data contains.

Question 2

Does RAG eliminate hallucinations?

Accepted Answer

No, but it reduces them substantially. The model can still misinterpret retrieved context, ignore it in favor of training-data recall, or generate over-confidently when retrieval returns weak matches. Production RAG systems add retrieval quality scoring, citation requirements, and evaluation infrastructure on top to catch the failures that grounding alone misses.

Question 3

How do I detect hallucinations in production?

Accepted Answer

Three layers help. Citation tracing checks whether claimed facts appear in retrieved sources. LLM-as-judge runs a second model to score the answer for grounding. Human review samples a fraction of outputs and feeds corrections into the eval set. Funded teams budget for at least one of these layers from day one.

Question 4

Which models hallucinate less?

Accepted Answer

Larger models hallucinate less than smaller ones on average, and instruction-tuned models hallucinate less than base models. Claude 3.5 Sonnet and GPT-4o post the lowest hallucination rates on most benchmarks, with smaller open-source models like Llama 3.1 70B close behind. Model choice alone does not solve the problem, but it sets the floor.