Video Copy Detection (VCD) has been developed to identify instances of unauthorized or duplicated video content. This paper presents our second place solutions to the Meta AI Video Similarity Challenge (VSC22), CVPR 2023. In order to compete in this challenge, we propose Feature-Compatible Progressive Learning (FCPL) for VCD. FCPL trains various models that produce mutually-compatible features, meaning that the features derived from multiple distinct models can be directly compared with one another. We find this mutual compatibility enables feature ensemble. By implementing progressive learning and utilizing labeled ground truth pairs, we effectively gradually enhance performance. Experimental results demonstrate the superiority of the proposed FCPL over other competitors. Our code is available at https://github.com/WangWenhao0716/VSC-DescriptorTrack-Submission and https://github.com/WangWenhao0716/VSC-MatchingTrack-Submission.
翻译:视频拷贝检测(VCD)技术旨在识别未经授权或重复的视频内容。本文介绍了我们在CVPR 2023 Meta AI视频相似性挑战赛(VSC22)中获得第二名的解决方案。为应对该挑战,我们提出了面向VCD的特征兼容渐进式学习(FCPL)方法。FCPL训练生成相互兼容特征的多种模型,即来自不同模型的特征可直接进行相互比较。我们发现这种互兼容性能够实现特征集成。通过实施渐进式学习并利用标注的真值对,我们有效逐步提升了性能。实验结果表明,所提出的FCPL方法在与其他竞争方案的比较中展现出优越性。我们的代码已开源至https://github.com/WangWenhao0716/VSC-DescriptorTrack-Submission和https://github.com/WangWenhao0716/VSC-MatchingTrack-Submission。