探索基于大型语言模型的迭代可控摘要生成 (Exploring Iterative Controllable Summarization with Large Language Models)

Large language models (LLMs) have demonstrated remarkable performance in abstractive summarization tasks. However, their ability to precisely control summary attributes (e.g., length or topic) remains underexplored, limiting their adaptability to specific user preferences. In this paper, we systematically explore the controllability of LLMs. To this end, we revisit summary attribute measurements and introduce iterative evaluation metrics, failure rate and average iteration count to precisely evaluate controllability of LLMs, rather than merely assessing errors. Our findings show that LLMs struggle more with numerical attributes than with linguistic attributes. To address this challenge, we propose a guide-to-explain framework (GTE) for controllable summarization. Our GTE framework enables the model to identify misaligned attributes in the initial draft and guides it in self-explaining errors in the previous output. By allowing the model to reflect on its misalignment, GTE generates well-adjusted summaries that satisfy the desired attributes with robust effectiveness, requiring surprisingly fewer iterations than other iterative approaches.

翻译：大型语言模型（LLMs）在抽象摘要任务中已展现出卓越的性能。然而，其在精确控制摘要属性（如长度或主题）方面的能力仍未得到充分探索，这限制了模型对特定用户偏好的适应性。本文系统性地探究了LLMs的可控性。为此，我们重新审视了摘要属性的度量方法，并引入了迭代评估指标——失败率与平均迭代次数，以精确评估LLMs的可控性，而非仅仅评估错误。我们的研究结果表明，LLMs在处理数值属性时比处理语言属性面临更大困难。为应对这一挑战，我们提出了一种用于可控摘要的引导-解释框架（GTE）。该框架使模型能够识别初始草稿中未对齐的属性，并引导其自我解释先前输出中的错误。通过让模型反思其未对齐之处，GTE能够生成经过良好调整的摘要，以稳健的效果满足期望属性，且所需迭代次数显著少于其他迭代方法。

相关内容

属性

关注 1

一个具体事物，总是有许许多多的性质与关系，我们把一个事物的性质与关系，都叫作事物的属性。事物与属性是不可分的，事物都是有属性的事物，属性也都是事物的属性。一个事物与另一个事物的相同或相异，也就是一个事物的属性与另一事物的属性的相同或相异。由于事物属性的相同或相异，客观世界中就形成了许多不同的事物类。具有相同属性的事物就形成一类，具有不同属性的事物就分别地形成不同的类。

稀疏自编码器综述：解释大语言模型的内部机制

专知会员服务

17+阅读 · 2025年12月27日

大语言模型中的检索与结构化增强生成综述

专知会员服务

32+阅读 · 2025年9月17日

【伯克利博士论文】基于代码结构感知方法推进代码生成大型语言模型的发展

专知会员服务

23+阅读 · 2025年7月21日

【斯坦福大学Xiang Lisa Li博士论文】控制语言模型

专知会员服务

22+阅读 · 2025年6月11日