Audio has become an increasingly crucial biometric modality due to its ability to provide an intuitive way for humans to interact with machines. It is currently being used for a range of applications, including person authentication to banking to virtual assistants. Research has shown that these systems are also susceptible to spoofing and attacks. Therefore, protecting audio processing systems against fraudulent activities, such as identity theft, financial fraud, and spreading misinformation, is of paramount importance. This paper reviews the current state-of-the-art techniques for detecting audio spoofing and discusses the current challenges along with open research problems. The paper further highlights the importance of considering the ethical and privacy implications of audio spoofing detection systems. Lastly, the work aims to accentuate the need for building more robust and generalizable methods, the integration of automatic speaker verification and countermeasure systems, and better evaluation protocols.
翻译:音频因其为人类与机器交互提供直观方式而日益成为一种关键的生物特征模态,目前已被广泛应用于从个人身份认证、银行服务到虚拟助手等多种场景。研究表明,这些系统同样易受到伪造和攻击。因此,保护音频处理系统免受身份盗窃、金融欺诈和虚假信息传播等欺诈行为的侵害至关重要。本文综述了当前用于检测音频伪造的最先进技术,并讨论了现有挑战及开放研究问题。文章进一步强调了考虑音频伪造检测系统的伦理和隐私影响的重要性。最后,本文旨在强调构建更鲁棒和泛化能力更强的方法、集成自动说话人验证与反制系统以及改进评估协议的必要性。