A Lightweight Constrained Generation Alternative for Query-focused Summarization

Query-focused summarization (QFS) aims to provide a summary of a document that satisfies information need of a given query and is useful in various IR applications, such as abstractive snippet generation. Current QFS approaches typically involve injecting additional information, e.g. query-answer relevance or fine-grained token-level interaction between a query and document, into a finetuned large language model. However, these approaches often require extra parameters \& training, and generalize poorly to new dataset distributions. To mitigate this, we propose leveraging a recently developed constrained generation model Neurological Decoding (NLD) as an alternative to current QFS regimes which rely on additional sub-architectures and training. We first construct lexical constraints by identifying important tokens from the document using a lightweight gradient attribution model, then subsequently force the generated summary to satisfy these constraints by directly manipulating the final vocabulary likelihood. This lightweight approach requires no additional parameters or finetuning as it utilizes both an off-the-shelf neural retrieval model to construct the constraints and a standard generative language model to produce the QFS. We demonstrate the efficacy of this approach on two public QFS collections achieving near parity with the state-of-the-art model with substantially reduced complexity.

翻译：查询聚焦摘要旨在提供满足给定查询信息需求的文档摘要，在抽象片段生成等多项信息检索应用中具有实用价值。当前方法通常需将额外信息（如查询-答案相关性或查询与文档间的细粒度标记级交互）注入微调后的大语言模型。然而这些方法常需额外参数与训练，且难以泛化至新数据集分布。为此，我们提出利用近期开发的约束生成模型Neurological Decoding作为替代方案，摒弃依赖附加子架构与训练的现有方法。我们首先通过轻量级梯度归因模型从文档中识别关键标记构建词汇约束，继而通过直接操纵最终词汇概率强制生成摘要满足这些约束。该轻量方法无需额外参数或微调，因其同时利用现成神经检索模型构建约束和标准生成型语言模型进行查询聚焦摘要。我们在两个公开查询聚焦摘要数据集上验证了方法的有效性，以显著降低的复杂度实现了与最先进模型近乎相当的性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/