Atypical speech is receiving greater attention in speech technology research, but much of this work unfolds with limited interdisciplinary dialogue. For stuttered speech in particular, it is widely recognised that current speech recognition systems fall short in practice, and current evaluation methods and research priorities are not systematically grounded in end-user experiences and needs. In this work, we analyse these gaps through 1) a scoping review of papers that deal with stuttered speech and 2) a survey of 70 stakeholders, including adults who stutter and speech-language pathologists. By analysing these two perspectives, we propose a taxonomy of stuttered-speech research, identify where current research directions diverge from the needs articulated by stakeholders, and conclude by outlining concrete guidelines and directions towards addressing the real needs of the stuttering community.
翻译:非典型语音在语音技术研究中日益受到关注,但大部分相关工作缺乏跨学科对话。就口吃语音而言,现有语音识别系统在实际应用中表现不足已是共识,当前评估方法和研究重点尚未系统性地建立在终端用户体验和需求的基础上。本研究通过以下两方面分析这些差距:1)针对口吃语音相关论文的范围综述;2)面向70位利益相关者(包括成年口吃者和语言病理学家)的调查。通过分析这两种视角,我们提出口吃语音研究的分类体系,揭示当前研究方向与利益相关者明确需求之间的分歧,并最终提出应对口吃群体实际需求的具体指导方针与方向。