RAG in production is less about fancy retrieval tricks and more about guardrails: evals, observability, and deterministic fallbacks when retrieval is weak.
- Track retrieval quality (top-k hit rate, MRR) per index + query type.
- Use domain schemas and chunking strategies that mirror how SMEs think.
- Evaluate end-to-end with golden sets and regression tests.