RE-RAG：通过检索增强生成中的相关性估计器提升开放域问答性能与可解释性 (RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation) - 专知论文

会员服务 ·

0

RE · Performer · 估计/估计量 · 知识 (knowledge) · 自动问答 ·

2024 年 10 月 24 日

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

翻译：RE-RAG：通过检索增强生成中的相关性估计器提升开放域问答性能与可解释性

Kiseung Kim,Jay-Yoon Lee

The Retrieval Augmented Generation (RAG) framework utilizes a combination of parametric knowledge and external knowledge to demonstrate state-of-the-art performance on open-domain question answering tasks. However, the RAG framework suffers from performance degradation when the query is accompanied by irrelevant contexts. In this work, we propose the RE-RAG framework, which introduces a relevance estimator (RE) that not only provides relative relevance between contexts as previous rerankers did, but also provides confidence, which can be used to classify whether given context is useful for answering the given question. We propose a weakly supervised method for training the RE simply utilizing question-answer data without any labels for correct contexts. We show that RE trained with a small generator (sLM) can not only improve the sLM fine-tuned together with RE but also improve previously unreferenced large language models (LLMs). Furthermore, we investigate new decoding strategies that utilize the proposed confidence measured by RE such as choosing to let the user know that it is "unanswerable" to answer the question given the retrieved contexts or choosing to rely on LLM's parametric knowledge rather than unrelated contexts.

翻译：检索增强生成（RAG）框架结合参数化知识与外部知识，在开放域问答任务上展现了最先进的性能。然而，当查询伴随不相关上下文时，RAG框架会出现性能下降。本文提出RE-RAG框架，引入一个相关性估计器（RE），该估计器不仅如先前重排序器那样提供上下文之间的相对相关性，还能提供置信度，用于判断给定上下文是否对回答给定问题有用。我们提出一种弱监督方法来训练RE，仅利用问答数据而无需任何正确上下文的标注。实验表明，使用小型生成器（sLM）训练的RE不仅能提升与RE共同微调的sLM的性能，还能改进先前未引用的大型语言模型（LLM）。此外，我们探索了利用RE所提供置信度的新解码策略，例如选择告知用户基于检索到的上下文“无法回答”问题，或选择依赖LLM的参数化知识而非不相关的上下文。

0

相关内容

IEEE国际需求工程会议是研究人员、实践者、教育工作者和学生展示和讨论需求工程学科最新创新、经验和关注点的首要国际论坛。这次会议将为学术界、政府和工业界提供一个广泛的项目，其中包括几位杰出的主旨演讲人和三天的会议，会议内容包括论文、专题讨论、海报和演示。官网链接：https://re20.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model

Arxiv

0+阅读 · 2024年12月6日

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Arxiv

0+阅读 · 2024年12月5日

VMGuard: Reputation-Based Incentive Mechanism for Poisoning Attack Detection in Vehicular Metaverse

Arxiv

0+阅读 · 2024年12月5日

RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models

Arxiv

0+阅读 · 2024年12月5日

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Arxiv

0+阅读 · 2024年12月5日

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Arxiv

0+阅读 · 2024年12月4日

Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs

Arxiv

0+阅读 · 2024年12月4日

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

Arxiv

11+阅读 · 2022年6月27日

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Arxiv

20+阅读 · 2021年5月27日

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Arxiv

21+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

估计/估计量

知识 (knowledge)

最新内容

【博士论文】已对齐 AI 系统的持续脆弱性

【博士论文】已对齐 AI 系统的持续脆弱性

专知会员服务

0+阅读 · 今天14:47

潜空间综述：基础、演化、机制、能力与展望

潜空间综述：基础、演化、机制、能力与展望

专知会员服务

0+阅读 · 今天14:42

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

专知会员服务

13+阅读 · 今天2:46

《人工智能时代的国防工业政策》

《人工智能时代的国防工业政策》

专知会员服务

6+阅读 · 今天2:39

来自乌克兰与伊朗冲突的经验：战场适应力仍是关键

来自乌克兰与伊朗冲突的经验：战场适应力仍是关键

专知会员服务

7+阅读 · 今天2:05

《无人机前沿：印度武装力量无人机库存、全球经验与非接触式动能战争的战略要务》

《无人机前沿：印度武装力量无人机库存、全球经验与非接触式动能战争的战略要务》

专知会员服务

7+阅读 · 今天2:00

《用于高功率微波反蜂群作战的生成式人工智能方法》技术报告

《用于高功率微波反蜂群作战的生成式人工智能方法》技术报告

专知会员服务

12+阅读 · 今天1:49

“美国情报界年度威胁评估报告”中的技术挑战描述

“美国情报界年度威胁评估报告”中的技术挑战描述

专知会员服务

3+阅读 · 今天1:37

《升级动态与核门槛政治：对伊朗-美国冲突（2024-2026）的定量-分析评估》

《升级动态与核门槛政治：对伊朗-美国冲突（2024-2026）的定量-分析评估》

专知会员服务

8+阅读 · 今天1:35

《2026年美国/以色列-伊朗冲突》

《2026年美国/以色列-伊朗冲突》

专知会员服务

5+阅读 · 今天1:30

《美国与伊朗的冲突》美国会服务处报告

《美国与伊朗的冲突》美国会服务处报告

专知会员服务

5+阅读 · 今天1:27

美国对伊朗军事行动：弹药与反导

美国对伊朗军事行动：弹药与反导

专知会员服务

6+阅读 · 今天1:25

超越技术：伊朗冲突中的“战争方式”

超越技术：伊朗冲突中的“战争方式”

专知会员服务

13+阅读 · 4月1日

军事决策大语言模型综合评价基准

军事决策大语言模型综合评价基准

专知会员服务

11+阅读 · 4月1日

利用核国家战略互动博弈（SIGNAL）进行实验性兵棋推演

利用核国家战略互动博弈（SIGNAL）进行实验性兵棋推演

专知会员服务

9+阅读 · 4月1日

相关VIP内容

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

潜空间综述：基础、演化、机制、能力与展望

《人工智能时代的国防工业政策》

【博士论文】已对齐 AI 系统的持续脆弱性

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

相关论文

CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model

Arxiv

0+阅读 · 2024年12月6日

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Arxiv

0+阅读 · 2024年12月5日

VMGuard: Reputation-Based Incentive Mechanism for Poisoning Attack Detection in Vehicular Metaverse

Arxiv

0+阅读 · 2024年12月5日

RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models

Arxiv

0+阅读 · 2024年12月5日

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Arxiv

0+阅读 · 2024年12月5日

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Arxiv

0+阅读 · 2024年12月4日

Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs

Arxiv

0+阅读 · 2024年12月4日

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

Arxiv

11+阅读 · 2022年6月27日

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Arxiv

20+阅读 · 2021年5月27日

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Arxiv

21+阅读 · 2018年1月16日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员