Copy-move forgery on speech (CMF), coupled with post-processing techniques, presents a great challenge to the forensic detection and localization of tampered areas. Most of the existing CMF detection approaches necessitate pre-segmentation of speech to facilitate similarity calculations among these segments. However, these approaches usually suffer from the problems of uncontrollable computational complexity and sensitivity to the presence of a word that is read multiple times within a speech recording. To address these issues, we propose a local feature tensors-based CMF detection algorithm that can transform duplicate detection and localization problems into a special tensor-matching procedure, accompanied by complete theoretical analysis as support. Through extensive experimentation, we have demonstrated that our method exhibits computational efficiency and robustness against post-processing techniques. Notably, it can effectively and blindly detect tampered segments, even those as short as a fractional second. These advantages highlight the promising potential of our approach for practical applications.
翻译:语音复制-移动伪造(CMF)结合后处理技术,对篡改区域的司法检测与定位构成了巨大挑战。现有大多数CMF检测方法需预先对语音进行分割,以促进这些片段间的相似性计算。然而,这些方法通常面临计算复杂度不可控以及对语音录音中多次朗读的单词敏感等问题。为解决上述问题,我们提出一种基于局部特征张量的CMF检测算法,该算法可将重复检测与定位问题转化为特殊的张量匹配过程,并辅以完整的理论分析作为支撑。通过大量实验证明,我们的方法在计算效率和对后处理技术的鲁棒性方面表现优异。值得注意的是,该方法能够有效且盲目地检测出篡改片段,即使是仅有零点几秒的短片段。这些优势凸显了该方法在实践应用中具有广阔前景。