An in-depth treatment of modern RAG and Deep Research, grounded in an AI-assisted systematic analysis of 2,000+ papers (2020–2026).
An interactive survey of ~2,000 papers, organized by topic and continuously updated before and after the tutorial. A single running example evolves across all sessions, and live polls let attendees predict benchmark results before the reveal.
More related surveys & courses. Browse our continuously growing collection of surveys, reading lists, and short courses on factuality, RAG, and agents — all curated and kept current at papers.lunadong.com.
Early inventors of RAG, active researchers, and industry practitioners — also the organizing team behind KDD Cup 2024 (CRAG) and 2025 (CRAG-MM).








Despite well-known hallucination issues, LLMs have become an increasingly indispensable source of information, and the underlying technology has advanced rapidly to make their answers far more reliable.
Early on, Retrieval-Augmented Generation (RAG) emerged as the dominant remedy, grounding LLM responses in external knowledge and evolving from simple retrieve-then-read pipelines into modular, graph-enhanced, and agentic systems. More recently, Agentic Deep Research has pushed the frontier further, equipping LLMs with autonomous planning, multi-hop investigation, and iterative synthesis to tackle open-ended questions that no single retrieval pass can answer.
This tutorial offers an in-depth treatment of modern RAG and Deep Research, grounded in an AI-assisted systematic analysis of 2,000+ recent papers (2020–2026). Attendees will leave with a structured roadmap, evidence-backed practical recommendations, and a clear map of open research opportunities.
A single question grows across the tutorial — from a simple lookup to an open-ended research task — each technique visibly improving on the last. We follow one running electric-vehicle example across the full range of question types:
In NLP, data mining, and knowledge management seeking a structured map of the RAG landscape and its open problems.
Building production LLM systems who need evidence-backed guidelines for choosing RAG architectures.
Entering the field, wanting a comprehensive entry point to 2K+ papers organized by topic and timeline.
Prerequisites. Familiarity with basic LLM concepts (pre-training, fine-tuning, prompting). No prior knowledge of RAG or hallucination detection is assumed.
★ denotes recommended pre-reading. The complete ~2,000-paper list lives in the online companion.