Retrieval-Augmented Generation (RAG) has become a cornerstone of knowledge-intensive applications, including enterprise chatbots, healthcare assistants, and agentic memory management. However, recent studies show that knowledge-extraction attacks can recover sensitive knowledge-base content through maliciously crafted queries, raising serious intellectual property and privacy concerns. While prior work has explored individual attack and defense techniques, the research landscape remains fragmented, spanning heterogeneous retrieval embeddings, diverse generation models, and evaluations based on non-standardized metrics and inconsistent datasets. To address this gap, we introduce the first systematic benchmark for knowledge-extraction attacks on RAG systems. Our benchmark covers broad attack/defense strategies, representative retrieval embedding models, open/closed-source generators, (non) graph-based indexing, all evaluated under a unified experimental framework with standardized protocols across multiple datasets spanning diverse languages. By consolidating the experimental landscape and enabling reproducible, comparable evaluation, this benchmark provides actionable insights and a practical foundation for developing privacy-preserving RAG systems in the face of emerging knowledge extraction threats.
翻译:检索增强生成(RAG)已成为知识密集型应用的基石,包括企业级聊天机器人、医疗辅助系统及智能体记忆管理。然而近期研究表明,知识提取攻击可通过恶意构造的查询恢复敏感知识库内容,引发严重的知识产权与隐私问题。尽管已有研究探索了个体攻击与防御技术,但该领域研究仍呈碎片化态势——涉及异构检索嵌入、多样化生成模型,以及基于非标准化指标和不一致数据集的评估。为弥补这一空白,我们首次提出针对RAG系统知识提取攻击的系统性基准测试。该基准涵盖广泛攻击/防御策略、代表性检索嵌入模型、开源/闭源生成器及基于(非)图的索引方案,所有方案均在统一实验框架下评估,采用跨多语言数据集的标准化协议。通过整合实验生态并实现可复制、可对比的评估,本基准为应对新兴知识提取威胁下的隐私保护型RAG系统开发提供了可操作见解与实践基础。