Video matting has broad applications, from adding interesting effects to casually captured movies to assisting video production professionals. Matting with associated effects such as shadows and reflections has also attracted increasing research activity, and methods like Omnimatte have been proposed to separate dynamic foreground objects of interest into their own layers. However, prior works represent video backgrounds as 2D image layers, limiting their capacity to express more complicated scenes, thus hindering application to real-world videos. In this paper, we propose a novel video matting method, OmnimatteRF, that combines dynamic 2D foreground layers and a 3D background model. The 2D layers preserve the details of the subjects, while the 3D background robustly reconstructs scenes in real-world videos. Extensive experiments demonstrate that our method reconstructs scenes with better quality on various videos.
翻译:视频抠像具有广泛的应用,从为随意拍摄的电影添加有趣特效到辅助视频制作专业人员。带有阴影、反射等相关特效的抠像也吸引了越来越多研究活动,诸如Omnimatte等方法已被提出,用于将感兴趣的动态前景对象分离到各自的图层中。然而,先前的工作将视频背景表示为2D图像图层,限制了其表达更复杂场景的能力,从而阻碍了在现实世界视频中的应用。本文提出一种新颖的视频抠像方法OmnimatteRF,该方法结合了动态2D前景图层和3D背景模型。2D图层保留了主体的细节,而3D背景则鲁棒地重建了现实视频中的场景。大量实验表明,我们的方法在各种视频上能以更高质量重建场景。