A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion

Grid sentence is commonly used for studying the Lombard effect and Normal-to-Lombard conversion. However, it's unclear if Normal-to-Lombard models trained on grid sentences are sufficient for improving natural speech intelligibility in real-world applications. This paper presents the recording of a parallel Lombard corpus (called Lombard Chinese TIMIT, LCT) extracting natural sentences from Chinese TIMIT. Then We compare natural and grid sentences in terms of Lombard effect and Normal-to-Lombard conversion using LCT and Enhanced MAndarin Lombard Grid corpus (EMALG). Through a parametric analysis of the Lombard effect, We find that as the noise level increases, both natural sentences and grid sentences exhibit similar changes in parameters, but in terms of the increase of the alpha ratio, grid sentences show a greater increase. Following a subjective intelligibility assessment across genders and Signal-to-Noise Ratios, the StarGAN model trained on EMALG consistently outperforms the model trained on LCT in terms of improving intelligibility. This superior performance may be attributed to EMALG's larger alpha ratio increase from normal to Lombard speech.

翻译：网格句常用于研究伦巴第效应以及正常到伦巴第的语音转换。然而，目前尚不清楚基于网格句训练的正常到伦巴第转换模型能否有效提升真实场景中自然语音的可懂度。本文首先录制了一个并行伦巴第语料库（命名为伦巴第汉语TIMIT，LCT），该语料库从汉语TIMIT中提取自然语句。随后，利用LCT和增强型普通话伦巴第网格语料库（EMALG），我们从伦巴第效应和正常到伦巴第转换两个维度对自然语句与网格句进行了比较。通过伦巴第效应的参数分析，我们发现随着噪声水平增加，自然语句与网格句在参数变化上呈现相似趋势，但网格句在α比率增幅方面表现得更为显著。基于跨性别与不同信噪比条件下的主观可懂度评估，采用EMALG训练的StarGAN模型在提升可懂度方面始终优于基于LCT训练的StarGAN模型。这一优越性能可能归因于EMALG在从正常语音到伦巴第语音转换过程中α比率增幅更大。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日