Phonetics is the scientific field concerned with the study of how speech is produced, heard and perceived. It abounds with data, such as acoustic speech recordings, neuroimaging data, or articulatory data. In this paper, we provide an introduction to different areas of phonetics (acoustic phonetics, sociophonetics, speech perception, articulatory phonetics, speech inversion, sound change, and speech technology), an overview of the statistical methods for analyzing their data, and an introduction to the signal processing methods commonly applied to speech recordings. A major transition in the statistical modeling of phonetic data has been the shift from fixed effects to random effects regression models, the modeling of curve data (for instance via GAMMs or FDA methods), and the use of Bayesian methods. This shift has been driven in part by the increased focus on large speech corpora in phonetics, which has been driven by machine learning methods such as forced alignment. We conclude by identifying opportunities for future research.
翻译:语音学是研究言语如何产生、听觉如何接收以及如何被感知的科学领域。该领域充斥着大量数据,例如语音声学录音、神经影像数据或发音数据。本文介绍了语音学的不同分支(声学语音学、社会语音学、言语感知、发音语音学、语音反演、音变及语音技术),概述了分析这些数据的统计方法,并介绍了常用于语音录音的信号处理方法。语音数据统计建模的一个重要转变体现在:从固定效应回归模型转向随机效应回归模型、曲线数据建模(例如通过GAMMs或FDA方法)以及贝叶斯方法的应用。这一转变部分源于语音学对大规模语音语料库研究的日益重视,而诸如强制对齐等机器学习方法推动了这一趋势。最后,本文指出了未来研究的潜在方向。