Retrieval-augmented generation (RAG) effectively addresses issues of static knowledge and hallucination in large language models. Existing studies mostly focus on question scenarios with clear user intents and concise answers. However, it is prevalent that users issue broad, open-ended queries with diverse sub-intents, for which they desire rich and long-form answers covering multiple relevant aspects. To tackle this important yet underexplored problem, we propose a novel RAG framework, namely RichRAG. It includes a sub-aspect explorer to identify potential sub-aspects of input questions, a multi-faceted retriever to build a candidate pool of diverse external documents related to these sub-aspects, and a generative list-wise ranker, which is a key module to provide the top-k most valuable documents for the final generator. These ranked documents sufficiently cover various query aspects and are aware of the generator's preferences, hence incentivizing it to produce rich and comprehensive responses for users. The training of our ranker involves a supervised fine-tuning stage to ensure the basic coverage of documents, and a reinforcement learning stage to align downstream LLM's preferences to the ranking of documents. Experimental results on two publicly available datasets prove that our framework effectively and efficiently provides comprehensive and satisfying responses to users.
翻译:检索增强生成(RAG)能有效解决大语言模型中静态知识与幻觉问题。现有研究多聚焦于用户意图明确且答案简洁的提问场景。然而,用户常提出宽泛、开放式的多子意图查询,期望获得覆盖多个相关方面的丰富长文本答案。为应对这一重要但尚未充分探索的问题,我们提出一种新颖的RAG框架——RichRAG。该框架包含子方面探索器(用于识别输入问题的潜在子方面)、多面检索器(用于构建与这些子方面相关的多样化外部文档候选池),以及生成式列表排序器(此为关键模块,负责为最终生成器提供最具价值的top-k文档)。经排序的文档充分覆盖查询的各个层面,且能感知生成器的偏好,从而激励其为用户生成丰富而全面的响应。排序器的训练包含监督微调阶段(确保文档的基础覆盖度)和强化学习阶段(使下游大语言模型的偏好与文档排序对齐)。在两个公开数据集上的实验结果证明,本框架能有效且高效地为用户提供全面而令人满意的响应。