Maintaining engagement in immersive meetings is challenging, particularly when users must catch up on missed content after disruptions. While transcription interfaces can help, table-fixed panels have the potential to distract users from the group, diminishing social presence, while avatar-fixed captions fail to provide past context. We present EngageSync, a context-aware avatar-fixed transcription interface that adapts based on user engagement, offering live transcriptions and LLM-generated summaries to enhance catching up while preserving social presence. We implemented a live VR meeting setup for a 12-participant formative study and elicited design considerations. In two user studies with small (3 avatars) and mid-sized (7 avatars) groups, EngageSync significantly improved social presence (p < .05) and time spent gazing at others in the group instead of the interface over table-fixed panels. Also, it reduced re-engagement time and increased information recall (p < .05) over avatar-fixed interfaces, with stronger effects in mid-sized groups (p < .01).
翻译:在沉浸式会议中保持参与度具有挑战性,特别是当用户在受到干扰后必须补上错过的内容时。虽然转录界面可以提供帮助,但固定在桌面上的面板可能会使用户从小组中分心,削弱社会临场感,而固定在虚拟形象上的字幕则无法提供过往情境。我们提出了EngageSync,一种基于用户参与度自适应调整的情境感知型虚拟形象固定转录界面,它提供实时转录和LLM生成的摘要,以在保持社会临场感的同时增强补课效果。我们实现了一个用于12人形成性研究的实时VR会议设置,并得出了设计考量。在小型(3个虚拟形象)和中型(7个虚拟形象)群体的两项用户研究中,与固定在桌面上的面板相比,EngageSync显著提高了社会临场感(p < .05),并增加了用户注视组内其他成员而非界面的时间。此外,与固定在虚拟形象上的界面相比,它减少了重新投入时间并提高了信息回忆率(p < .05),且在中型群体中效果更强(p < .01)。