Retrieval-Augmented Generation (RAG) systems represent a significant advancement over traditional Large Language Models (LLMs). RAG systems enhance their generation ability by incorporating external data retrieved through an Information Retrieval (IR) phase, overcoming the limitations of standard LLMs, which are restricted to their pre-trained knowledge and limited context window. Most research in this area has predominantly concentrated on the generative aspect of LLMs within RAG systems. Our study fills this gap by thoroughly and critically analyzing the influence of IR components on RAG systems. This paper analyzes which characteristics a retriever should possess for an effective RAG's prompt formulation, focusing on the type of documents that should be retrieved. We evaluate various elements, such as the relevance of the documents to the prompt, their position, and the number included in the context. Our findings reveal, among other insights, that including irrelevant documents can unexpectedly enhance performance by more than 30% in accuracy, contradicting our initial assumption of diminished quality. These results underscore the need for developing specialized strategies to integrate retrieval with language generation models, thereby laying the groundwork for future research in this field.
翻译:检索增强生成(RAG)系统相较于传统大语言模型(LLMs)取得了显著进步。RAG系统通过引入信息检索(IR)阶段获取外部数据来增强生成能力,克服了标准LLMs仅依赖预训练知识和有限上下文窗口的局限性。现有研究大多聚焦于RAG系统中LLMs的生成层面。本研究通过深入批判性分析IR组件对RAG系统的影响,填补了这一研究空白。本文系统分析了检索器应具备何种特征以实现有效的RAG提示构建,重点关注需检索的文档类型。我们评估了文档与提示的相关性、文档位置及上下文包含的文档数量等多个要素。研究发现表明:引入不相关文档竟能意外地将准确率提升超过30%,这与我们最初认为会降低性能的假设相悖。这些结果凸显了开发检索与语言生成模型集成专用策略的必要性,为该领域的未来研究奠定了基础。