We propose an objective intelligibility measure (OIM), called the Gammachirp Envelope Similarity Index (GESI), which can predict the speech intelligibility (SI) of simulated hearing loss (HL) sounds for normal hearing (NH) listeners. GESI is an intrusive method that computes the SI metric using the gammachirp filterbank (GCFB), the modulation filterbank, and the extended cosine similarity measure. The unique features of GESI are that i) it reflects the hearing impaired (HI) listener's HL that appears in the audiogram and is caused by active and passive cochlear dysfunction, ii) it provides a single goodness metric, as in the widely used STOI and ESTOI, that can be used immediately to evaluate SE algorithms, and iii) it provides a simple control parameter to accept the level asymmetry of the reference and test sounds and to deal with individual listening conditions and environments. We evaluated GESI and the conventional OIMs, STOI, ESTOI, MBSTOI, and HASPI versions 1 and 2 by using four SI experiments on words of male and female speech sounds in both laboratory and remote environments. GESI was shown to outperform the other OIMs in the evaluations. GESI could be used to improve SE algorithms in assistive listening devices for individual HI listeners.
翻译:摘要:我们提出了一种客观可懂度度量(OIM),称为Gammachirp包络相似性指数(GESI),该指标能够预测正常听力(NH)听者在模拟听力损失(HL)声音下的语音可懂度(SI)。GESI是一种侵入式方法,通过使用gammachirp滤波器组(GCFB)、调制滤波器组和扩展余弦相似度度量来计算SI指标。GESI的独特特征包括:i)它反映了听障(HI)听者听力图显示的、由主动和被动耳蜗功能障碍引起的听力损失;ii)它提供了类似广泛使用的STOI和ESTOI的单一优度指标,可立即用于评估语音增强(SE)算法;iii)它提供了一个简单的控制参数,以接受参考信号和测试信号的电平不对称性,并适应个体听音条件及环境。我们通过四项关于男性和女性语音词汇的SI实验,在实验室和远程环境中评估了GESI与传统OIM(包括STOI、ESTOI、MBSTOI以及HASPI版本1和2)的表现。结果表明,GESI在评估中优于其他OIM。GESI可用于改善助听设备中针对个体HI听者的SE算法。