Embodiment can enhance conversational agents, such as increasing their perceived presence. This is typically achieved through visual representations of a virtual body; however, visual modalities are not always available, such as when users interact with agents using headphones or display-less glasses. In this work, we explore auditory embodiment. By introducing auditory cues of bodily presence - through spatially localized voice and situated Foley audio from environmental interactions - we investigate how audio alone can convey embodiment and influence perceptions of a conversational agent. We conducted a 2 (spatialization: monaural vs. spatialized) x 2 (Foley: none vs. Foley) within-subjects study, where participants (n=24) engaged in conversations with agents. Our results show that spatialization and Foley increase co-presence, but reduce users' perceptions of the agent's attention and other social attributes.
翻译:具身化能够增强对话代理,例如提升其感知临场感。这通常通过虚拟身体的视觉表征实现;然而,视觉模态并非总是可用,例如当用户通过耳机或无显示屏眼镜与代理交互时。在本研究中,我们探索听觉具身化。通过引入身体存在的听觉线索——包括空间定位的语音和环境交互产生的情境拟音——我们探究仅凭音频如何传递具身化并影响对对话代理的感知。我们开展了一项2(空间化:单声道 vs. 空间化)×2(拟音:无 vs. 有)的被试内研究,参与者(n=24)与代理进行对话。结果显示,空间化与拟音能增强共在感,但会降低用户对代理注意力及其他社会属性的感知。