This work introduces a time domain personalized method (pGTFF0) to achieve intelligibility improvement of noisy speech for Autism Spectrum Disorder (ASD) situation. For this proposal, harmonic features estimated from speech frames are considered as center frequencies of Gammatone auditory filterbanks. A gain factor is further applied to the output of the filtered samples. The key goal is the emulation of an external noise filtering tailored for individuals with ASD. A perceptual listening test demonstrates that ASD volunteers attained lower intelligibility rates than Neurotypical (NT). The proposed solution is compared to three competing approaches considering four acoustic noises at different signal-to-noise ratios. Two objective measures (ESTOI and PESQ) are also adopted for evaluation. The experimental results show that the personalized solution outperformed the competing approaches in terms of intelligibility and quality improvement.
翻译:本文提出了一种时域个性化方法(pGTFF0),旨在改善自闭症谱系障碍(ASD)情境下噪声语音的清晰度。在该方案中,从语音帧中提取的谐波特征被用作伽马通听觉滤波器组的中心频率,并对滤波样本的输出进一步施加增益因子。核心目标是模拟针对ASD个体定制的外部噪声滤波机制。感知听力测试表明,ASD志愿者的清晰度评分低于神经典型(NT)个体。将所提方案与三种对比方法进行比较,涵盖四种声学噪声及不同信噪比条件。同时采用两种客观度量(ESTOI和PESQ)进行评估。实验结果表明,该个性化方法在清晰度和质量提升方面均优于对比方案。