Mobile messaging apps offer an increasing range of emotional expressions, such as emojis to help users manually augment their texting experiences. Accessibility of such augmentations is limited in voice messaging. With the term "speejis" we refer to accessible emojis and other visual speech emotion cues that are created automatically from speech input alone. The paper presents an implementation of speejis and reports on a user study (N=12) comparing the UX of voice messaging with and without speejis. Results show significant differences in measures such as attractiveness and stimulation and a clear preference of all participants for messaging with speejis. We highlight the benefits of using paralinguistic speech processing and continuous emotion models to enable finer grained augmentations of emotion changes and transitions within a single message in addition to augmentations of the overall tone of the message.
翻译:移动消息应用提供了日益丰富的情感表达方式,例如表情符号,以帮助用户手动增强其文本交流体验。然而,在语音消息中,此类增强功能的可及性有限。我们提出“speejis”这一术语,指代仅通过语音输入自动生成的可及性表情符号及其他视觉语音情感线索。本文介绍了一种speejis的实现方案,并报告了一项用户研究(N=12),比较了使用与不使用speejis时语音消息的用户体验。结果显示,在吸引力和刺激性等指标上存在显著差异,且所有参与者都明确偏好使用speejis的消息方式。我们强调了利用副语言语音处理和连续情感模型的优势,除了能增强消息的整体语调外,还能实现对单条消息内部情感变化与过渡的更细粒度增强。