We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones. Our hierarchy system decomposes motions based on the granularity and reveals the correlations between parts without exploiting any prior structural knowledge. We further propose to regularize the bones to be positioned at the basis of motions, centers of parts, sufficiently covering related surfaces of the part. This is achieved by our bone occupancy function, which identifies whether a given 3D point is placed within the bone. Coupling the proposed components, our framework offers several clear advantages: (1) users can obtain animatable 3D models of the arbitrary objects in improved quality from their casual videos, (2) users can manipulate 3D models in an intuitive manner with minimal costs, and (3) users can interactively add or delete control points as necessary. The experimental results demonstrate the efficacy of our framework on diverse instances, in reconstruction quality, interpretability and easier manipulation. Our code is available at https://github.com/subin6/HSNB.
翻译:我们提出了一种新框架,用于利用随意拍摄的视频创建并轻松操控任意物体的三维模型。我们的核心组件是一种新颖的层次化变形模型,该模型通过树状结构的骨骼捕捉物体的运动。我们的层次系统基于运动粒度进行分解,并在无需利用任何先验结构知识的情况下揭示部件之间的关联。我们进一步提出对骨骼进行正则化,使其定位于运动的基础、部件的中心,并充分覆盖部件相关的表面区域。这是通过我们提出的骨骼占据函数实现的,该函数可判断给定的三维点是否位于骨骼内部。通过耦合所提出的组件,我们的框架具有以下明显优势:(1) 用户可从其随意拍摄的视频中以更高的质量获得任意物体的可动画三维模型;(2) 用户能够以直观的方式、以极低的成本操控三维模型;(3) 用户可根据需要交互式地添加或删除控制点。实验结果证明了我们的框架在多种实例上的有效性,体现在重建质量、可解释性以及更便捷的操控性方面。我们的代码公开于 https://github.com/subin6/HSNB。