In this paper, we examine the research conducted in the field of Nepali Automatic Speech Recognition (ASR). The primary objective of this survey is to conduct a comprehensive review of the works on Nepali Automatic Speech Recognition Systems completed to date, explore the different datasets used, examine the technology utilized, and take account of the obstacles encountered in implementing the Nepali ASR system. In tandem with the global trends of ever-increasing research on speech recognition based research, the number of Nepalese ASR-related projects are also growing. Nevertheless, the investigation of language and acoustic models of the Nepali language has not received adequate attention compared to languages that possess ample resources. In this context, we provide a framework as well as directions for future investigations.
翻译:本文系统梳理了尼泊尔语自动语音识别(ASR)领域的研究进展。本综述的首要目标是对迄今完成的尼泊尔语自动语音识别系统相关工作进行全面回顾,探究所采用的不同数据集,审查所使用的技术,并记录实施尼泊尔语ASR系统过程中遇到的障碍。与全球语音识别研究持续增长的趋势同步,尼泊尔语ASR相关项目数量也在不断增加。然而,相较于资源丰富的语言,尼泊尔语的语言模型和声学模型研究尚未获得充分关注。在此背景下,我们提出了一个研究框架及未来研究方向的指引。