We proposed a new objective intelligibility measure (OIM), called the Gammachirp Envelope Similarity Index (GESI), which can predict the speech intelligibility (SI) of simulated hearing loss (HL) sounds for normal hearing (NH) listeners. GESI is an intrusive method that computes the SI metric using the gammachirp filterbank (GCFB), the modulation filterbank, and the extended cosine similarity measure. The unique features of GESI are i) to reflect the hearing impaired (HI) listener's HL that appears in the audiogram and is caused by active and passive cochlear dysfunction, ii) to provide a single goodness metric, as in the widely used STOI and ESTOI, that can be used immediately to evaluate SE algorithms, and iii) to provide a simple control parameter to accept the level asymmetry of the reference and test sounds and to deal with individual listening conditions and environments. For evaluation, we conducted four SI experiments on words of male and female speech sounds in both laboratory and remote environments. We then evaluated GESI and the conventional OIMs, STOI, ESTOI, MBSTOI, and HASPI versions 1 and 2, in three test schemes for their ability to predict mean and individual SI values with and without the use of simulated HL sounds. GESI was shown to outperform the other OIMs in the evaluations. GESI could be used to improve SE algorithms in assistive listening devices for individual HI listeners.
翻译:摘要:我们提出了一种名为伽马啁啾包络相似性指数(GESI)的新型客观可懂度度量(OIM),能够预测正常听力(NH)听众对模拟听力损失(HL)声音的言语可懂度(SI)。GESI是一种侵入式方法,通过伽马啁啾滤波器组(GCFB)、调制滤波器组以及扩展余弦相似度度量来计算SI指标。GESI的独特特征包括:i) 反映听力受损(HI)听众在听力图中表现出的、由主动与被动耳蜗功能障碍引起的听力损失;ii) 如同广泛使用的STOI和ESTOI一样,提供可直接用于评估语音增强(SE)算法的单一优度指标;iii) 提供简单的控制参数以接受参考信号与测试信号的幅度不对称性,并适应个体聆听条件与环境。为进行评估,我们在实验室和远程环境中对男声和女声单词进行了四项SI实验。随后,在三种测试方案下,我们评估了GESI及传统OIM(STOI、ESTOI、MBSTOI、HASPI v1和v2)预测使用与未使用模拟HL声音时的平均及个体SI值的能力。评估表明,GESI优于其他OIM。GESI可被用于改进面向个体HI听众的辅助聆听设备中的SE算法。