Common ground plays a critical role in situated spoken dialogues, where interlocutors must establish and maintain shared references to entities, events, and relations to sustain coherent interaction. For dialog systems, the ability to correctly ground conversational content in order to refer back to it later is particularly important. Prior studies have demonstrated that LLMs are capable of performing grounding acts such as requesting clarification or producing acknowledgments, yet relatively little work has investigated how common ground can be explicitly represented and stored for later use. Without such mechanisms, it remains unclear whether acknowledgment or clarification behaviors truly reflect a grounded understanding. In this work, we evaluate a model's ability to establish and exploit common ground through relational references to entities within the shared context in a situational dialogue. We test multiple methods for representing common ground in situated dialogues and further propose approaches to improve both the establishment of common ground and its subsequent use in the conversation.
翻译:共有基础在情境口语对话中扮演着关键角色,对话参与者必须建立并维持对实体、事件及关系的共享指称,以维持连贯的互动。对于对话系统而言,正确锚定对话内容以便后续引用的能力尤为重要。先前研究表明,大型语言模型能够执行诸如请求澄清或生成确认等基础行为,但关于如何显式表征和存储共有基础以供后续使用的研究相对较少。若缺乏此类机制,确认或澄清行为是否真正反映了基于基础的理解仍不明确。本研究通过情境对话中对共享语境内实体的关系指称,评估模型建立和利用共有基础的能力。我们测试了多种在情境对话中表征共有基础的方法,并进一步提出了改进共有基础建立及其在对话中后续使用的途径。