The International Phonetic Alphabet (IPA) serves to systematize phonemes in language, enabling precise textual representation of pronunciation. In Bengali phonology and phonetics, ongoing scholarly deliberations persist concerning the IPA standard and core Bengali phonemes. This work examines prior research, identifies current and potential issues, and suggests a framework for a Bengali IPA standard, facilitating linguistic analysis and NLP resource creation and downstream technology development. In this work, we present a comprehensive study of Bengali IPA transcription and introduce a novel IPA transcription framework incorporating a novel dataset with DL-based benchmarks.
翻译:国际音标(IPA)用于系统化语言中的音位,能够精确地以文本形式呈现发音。在孟加拉语音系学和语音学领域,关于IPA标准及核心孟加拉语音位的学术讨论仍在持续。本文梳理了既有研究,识别了当前及潜在问题,并提出了一套孟加拉语IPA标准框架,以促进语言分析、自然语言处理资源构建及下游技术开发。本研究对孟加拉语IPA转写进行了全面分析,并引入了一种新颖的IPA转写框架,该框架整合了新型数据集与基于深度学习的基准评估。