Intelligent interfaces increasingly use large language models to summarize user-generated content, yet these summaries emphasize what is mentioned while overlooking what is missing. This presence bias can mislead users who rely on summaries to make decisions. We present Domain Informed Summarization through Contrast (DiSCo), an expectation-based computational approach that makes absences visible by comparing each entity's content with domain topical expectations captured in reference distributions of aspects typically discussed in comparable accommodations. This comparison identifies aspects that are either unusually emphasized or missing relative to domain norms and integrates them into the generated text. In a user study across three accommodation domains, namely ski, beach, and city center, DiSCo summaries were rated as more detailed and useful for decision making than baseline large language model summaries, although slightly harder to read. The findings show that modeling expectations reduces presence bias and improves both transparency and decision support in intelligent summarization interfaces.
翻译:智能界面日益采用大型语言模型来总结用户生成的内容,但这些摘要往往强调已提及的信息,而忽略了缺失的内容。这种存在性偏差可能会误导依赖摘要进行决策的用户。我们提出了基于对比的领域感知摘要方法(DiSCo),这是一种基于期望的计算方法,通过将每个实体的内容与领域主题期望进行对比来凸显缺失信息,其中领域期望通过参考分布捕获了同类住宿中通常讨论的方面。该比较能识别出相对于领域规范被异常强调或缺失的方面,并将其整合到生成的文本中。在涵盖滑雪、海滩和市中心三个住宿领域的用户研究中,DiSCo摘要被认为比基线大型语言模型摘要更详细且对决策更有帮助,尽管可读性稍逊。研究结果表明,建模期望能够减少存在性偏差,并提升智能摘要界面的透明度与决策支持能力。