Polytuplet Loss: A Reverse Approach to Training Reading Comprehension and Logical Reasoning Models - 专知论文

会员服务 ·

0

逻辑推理 · 机器阅读理解 · 损失 · 损失函数 · 推理模型 ·

2023 年 4 月 3 日

Polytuplet Loss: A Reverse Approach to Training Reading Comprehension and Logical Reasoning Models

翻译：多组损失函数：一种训练阅读理解与逻辑推理模型的反向方法

Jeffrey Lu,Ivan Rodriguez

Throughout schooling, students are tested on reading comprehension and logical reasoning. Students have developed various strategies for completing such exams, some of which are generally thought to outperform others. One such strategy involves emphasizing relative accuracy over absolute accuracy and can theoretically produce the correct answer without full knowledge of the information required to solve the question. This paper examines the effectiveness of applying such a strategy to train transfer learning models to solve reading comprehension and logical reasoning questions. The models were evaluated on the ReClor dataset, a challenging reading comprehension and logical reasoning benchmark. While previous studies targeted logical reasoning skills, we focus on a general training method and model architecture. We propose the polytuplet loss function, an extension of the triplet loss function, to ensure prioritization of learning the relative correctness of answer choices over learning the true accuracy of each choice. Our results indicate that models employing polytuplet loss outperform existing baseline models. Although polytuplet loss is a promising alternative to other contrastive loss functions, further research is required to quantify the benefits it may present.

翻译：在求学过程中，学生需通过考试检验阅读理解与逻辑推理能力。学生发展出多种应试策略，其中部分策略普遍被认为优于其他策略。一种典型策略强调相对准确性优于绝对准确性，理论上可在不完全掌握解题所需信息的情况下得出正确答案。本文旨在探究将该策略应用于训练迁移学习模型以解决阅读理解与逻辑推理问题的有效性。模型在具有挑战性的阅读理解与逻辑推理基准数据集ReClor上进行了评估。现有研究主要聚焦于逻辑推理技能，而本文则关注通用训练方法与模型架构。我们提出多组损失函数（polytuplet loss function），该函数作为三元组损失函数的扩展，旨在优先学习答案选项的相对正确性而非每个选项的真实准确性。实验结果表明，采用多组损失函数的模型性能优于现有基线模型。尽管多组损失函数可作为对比损失函数的有效替代方案，但量化其潜在优势仍需进一步研究。

0

相关内容

逻辑推理

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

专知会员服务

105+阅读 · 2021年6月13日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

开放知识图谱

0+阅读 · 2022年10月31日

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

PaperWeekly

3+阅读 · 2022年9月21日

论文浅尝 | XQA：一个跨语言开放域问答数据集

论文浅尝 | XQA：一个跨语言开放域问答数据集

开放知识图谱

26+阅读 · 2019年9月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

抗MRSA活性rhodomyrtosone B类似物的合成和构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

类黄酮与磷脂混合胶束的形成及其模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+调控骨髓基质干细胞BMP-2、Ang-1成骨及血管化的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

超低折射率二氧化硅光学薄膜的微观结构与光学性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-210 介导的牙周膜干细胞修复标准骨缺损及ERK/P38信号转导通路的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Rethinking Existential First Order Queries and their Inference on Knowledge Graphs

Arxiv

1+阅读 · 2023年5月24日

Machine Reading Comprehension using Case-based Reasoning

Arxiv

0+阅读 · 2023年5月24日

Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study

Arxiv

0+阅读 · 2023年5月22日

Teaching Probabilistic Logical Reasoning to Transformers

Arxiv

0+阅读 · 2023年5月22日

Stability, Generalization and Privacy: Precise Analysis for Random and NTK Features

Arxiv

0+阅读 · 2023年5月20日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

VIP会员

文章信息

相关主题

机器阅读理解

最新内容

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

1+阅读 · 今天16:54

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

1+阅读 · 今天16:52

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

6+阅读 · 今天8:00

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

5+阅读 · 今天7:44

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

4+阅读 · 今天7:28

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

4+阅读 · 今天7:18

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

专知会员服务

5+阅读 · 今天7:07

军事欺骗：供作战战术指挥官使用的工具

军事欺骗：供作战战术指挥官使用的工具

专知会员服务

4+阅读 · 今天7:03

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

4+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

6+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

10+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

4+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

5+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

8+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

7+阅读 · 6月23日

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

专知会员服务

105+阅读 · 2021年6月13日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Agentic RL：框架、实践与长程智能体训练

重新思考无人机时代的生存能力

综述 | 从问答到任务完成：Agent系统与Harness设计

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

相关资讯

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

开放知识图谱

0+阅读 · 2022年10月31日

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

PaperWeekly

3+阅读 · 2022年9月21日

论文浅尝 | XQA：一个跨语言开放域问答数据集

论文浅尝 | XQA：一个跨语言开放域问答数据集

开放知识图谱

26+阅读 · 2019年9月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Rethinking Existential First Order Queries and their Inference on Knowledge Graphs

Arxiv

1+阅读 · 2023年5月24日

Machine Reading Comprehension using Case-based Reasoning

Arxiv

0+阅读 · 2023年5月24日

Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study

Arxiv

0+阅读 · 2023年5月22日

Teaching Probabilistic Logical Reasoning to Transformers

Arxiv

0+阅读 · 2023年5月22日

Stability, Generalization and Privacy: Precise Analysis for Random and NTK Features

Arxiv

0+阅读 · 2023年5月20日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

相关基金

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

抗MRSA活性rhodomyrtosone B类似物的合成和构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

类黄酮与磷脂混合胶束的形成及其模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+调控骨髓基质干细胞BMP-2、Ang-1成骨及血管化的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

超低折射率二氧化硅光学薄膜的微观结构与光学性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-210 介导的牙周膜干细胞修复标准骨缺损及ERK/P38信号转导通路的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员