Evaluating the Tradeoff Between Abstractiveness and Factuality in Abstractive Summarization

Neural models for abstractive summarization tend to generate output that is fluent and well-formed but lacks semantic faithfulness, or factuality, with respect to the input documents. In this paper, we analyze the tradeoff between abstractiveness and factuality of generated summaries across multiple datasets and models, using extensive human evaluations of factuality. In our analysis, we visualize the rates of change in factuality as we gradually increase abstractiveness using a decoding constraint, and we observe that, while increased abstractiveness generally leads to a drop in factuality, the rate of factuality decay depends on factors such as the data that the system was trained on. We introduce two datasets with human factuality judgements; one containing 10.2k generated summaries with systematically varied degrees of abstractiveness; the other containing 4.2k summaries from five different summarization models. We propose new factuality metrics that adjust for the degree of abstractiveness, and we use them to compare the abstractiveness-adjusted factuality of previous summarization works, providing baselines for future work.

翻译：神经模型在抽象式摘要生成中往往能输出流畅且结构良好的文本，但缺乏对输入文档的语义忠实性（即事实性）。本文通过大量人工事实性评估，分析了多个数据集和模型所生成摘要的抽象程度与事实性之间的权衡关系。在分析中，我们利用解码约束逐步提升摘要的抽象程度，并可视化事实性随抽象程度增加的变化率。研究发现，尽管抽象程度提升通常导致事实性下降，但事实性衰减速率取决于系统训练数据的特性等因素。我们引入了两个包含人工事实性标注的数据集：一个包含10.2k条具有系统变化的抽象程度的生成摘要，另一个包含来自五种不同摘要模型的4.2k条摘要。我们提出了新的事实性评估指标，该指标可根据抽象程度进行调整，并利用这些指标比较了以往摘要工作的抽象程度调整后的事实性，为未来研究提供了基线。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【超赞的#C++#速查&信息图】“hacking c++ - Cheat Sheets & Infographics”

专知会员服务

30+阅读 · 2022年3月8日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日