Abstract: Retrieval-augmented generation (RAG) improves factuality by grounding large language models (LLMs) on external corpora, but it still struggles with multi-hop reasoning and long-context ...