A 3.5-hour, evidence-backed tour of modern RAG — grounded in an AI-assisted analysis of 2,000+ papers (2020–2026).
Organized by topic and continuously updated — explore methods, timelines, and benchmarks during and after the session. This tutorial is itself an instance of AI-facilitated Deep Research.
More related surveys & courses. Browse our growing collection of surveys, reading lists, and short courses on factuality, RAG, and agents — curated and kept current.
Explore more on papers.lunadong.com →A single running example evolves across all sessions, with live polls at key decision points. Each section links to its own slide deck.
Early inventors of RAG, active researchers, and industry practitioners — organizers of KDD Cup 2024 (CRAG) & 2025 (CRAG-MM).
Xin Luna DongPrincipal Scientist, Meta Wearables AI · ACM & IEEE Fellow
Sanat SharmaResearch Engineer, Meta Reality Labs
Scott Wen-tau YihResearch Scientist, Meta FAIR · Affiliate Professor, UW · ACL Fellow
Yinglong XiaApplied Research Scientist, Meta Recommendation Systems
Xiao YangResearch Scientist, Meta Reality Labs
Kai SunResearch Scientist, Meta Reality Labs
Jiaqi WangML Engineer, Meta Reality Labs
Franklin ZhangUW CSE · Online AI CompanionDespite well-known hallucination issues, LLMs have become an increasingly indispensable source of information, and the underlying technology has advanced rapidly to make their answers far more reliable. Early on, Retrieval-Augmented Generation (RAG) emerged as the dominant remedy, grounding LLM responses in external knowledge and evolving from simple retrieve-then-read pipelines into modular, graph-enhanced, and agentic systems. More recently, Agentic Deep Research has pushed the frontier further, equipping LLMs with autonomous planning, multi-hop investigation, and iterative synthesis to tackle open-ended questions that no single retrieval pass can answer.
This tutorial offers an in-depth treatment of modern RAG and Deep Research, grounded in an AI-assisted systematic analysis of 2,000+ recent papers (2020–2026). Attendees will leave with a structured roadmap, evidence-backed practical recommendations, and a clear map of open research opportunities.
Five independently optimizable stages — triggering, query rewriting, retrieval, post-processing, and answer generation.
Knowledge graphs and hybrid retrieval, GNN–LLM integration, and multi-hop reasoning for complex QA.
Learned retrieval policies and long-horizon agents that plan, browse, reason, and synthesize cited reports.
A single question grows across the tutorial — from a simple lookup to an open-ended research task — each technique visibly improving on the last. We follow one running electric-vehicle example across the full range of question types:
What is the battery capacity of a Tesla Model 3?Expected answer: a single number (e.g., "~60 kWh for the Standard Range, ~82 kWh for the Long Range").
Which has a longer driving range — the Tesla Model 3 or the Lucid Air?
I'm considering buying an electric vehicle in 2026. Which model is the best choice, and why?Expected answer: a short structured report identifying 2–3 top candidates, comparing them across key dimensions, and offering a recommendation tailored to the user — along with the main trade-offs.
[Photo of an EV in a parking lot] How far can it go on a full charge?Expected answer: "That looks like a Tesla Model 3 Long Range — it has an EPA-rated range of about 358 miles on a full charge."
★ marks recommended pre-reading. The full ~2,000-paper list lives in the online companion.