一种尺寸能否适应所有场景？：评估多文档摘要领域迁移中的失败情况 (Can one size fit all?: Measuring Failure in Multi-Document Summarization Domain Transfer)

Abstractive multi-document summarization (MDS) is the task of automatically summarizing information in multiple documents, from news articles to conversations with multiple speakers. The training approaches for current MDS models can be grouped into four approaches: end-to-end with special pre-training ("direct"), chunk-then-summarize, extract-then-summarize, and inference with GPT-style models. In this work, we evaluate MDS models across training approaches, domains, and dimensions (reference similarity, quality, and factuality), to analyze how and why models trained on one domain can fail to summarize documents from another (News, Science, and Conversation) in the zero-shot domain transfer setting. We define domain-transfer "failure" as a decrease in factuality, higher deviation from the target, and a general decrease in summary quality. In addition to exploring domain transfer for MDS models, we examine potential issues with applying popular summarization metrics out-of-the-box.

翻译：抽象式多文档摘要（MDS）是一项自动汇总多篇文档信息的任务，其文档范围涵盖新闻文章到多参与者对话。当前MDS模型的训练方法可分为四类：采用特殊预训练的端到端方法（"直接法"）、分块后摘要法、抽取后摘要法以及基于GPT风格模型的推理法。本研究通过评估不同训练方法、领域和维度（参考相似度、质量与事实性）下的MDS模型，旨在分析在零样本领域迁移场景中，基于某领域训练的模型为何及如何在其他领域（新闻、科学与对话）的文档摘要任务中出现失效。我们将领域迁移的"失败"定义为事实性降低、与目标摘要偏差增大以及摘要质量普遍下降。除探究MDS模型的领域迁移问题外，本研究还考察了直接应用主流摘要评估指标可能存在的潜在问题。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日