In this work, we recover the underlying 3D structure of non-geometrically consistent scenes. We focus our analysis on hand-drawn images from cartoons and anime. Many cartoons are created by artists without a 3D rendering engine, which means that any new image of a scene is hand-drawn. The hand-drawn images are usually faithful representations of the world, but only in a qualitative sense, since it is difficult for humans to draw multiple perspectives of an object or scene 3D consistently. Nevertheless, people can easily perceive 3D scenes from inconsistent inputs! In this work, we correct for 2D drawing inconsistencies to recover a plausible 3D structure such that the newly warped drawings are consistent with each other. Our pipeline consists of a user-friendly annotation tool, camera pose estimation, and image deformation to recover a dense structure. Our method warps images to obey a perspective camera model, enabling our aligned results to be plugged into novel-view synthesis reconstruction methods to experience cartoons from viewpoints never drawn before. Our project page is https://toon3d.studio/.
翻译:在本文中,我们恢复非几何一致场景的潜在三维结构,重点分析卡通和动漫中的手绘图像。许多卡通作品由艺术家在不借助三维渲染引擎的情况下创作,这意味着场景的每张新图像均为手工绘制。这些手绘图像通常以定性方式忠实反映现实世界,但人类难以从多个视角一致地绘制物体或场景的三维结构。尽管如此,人们仍能轻松从不一致的输入中感知三维场景!本文通过修正二维绘图的非一致性,恢复合理的三维结构,使得新扭曲后的绘图彼此一致。我们的流程包括用户友好的标注工具、相机姿态估计以及图像变形,以恢复稠密结构。该方法通过扭曲图像使其符合透视相机模型,从而将对齐后的结果接入新视角合成重建方法,实现从从未被绘制过的视点体验卡通场景。项目页面详见https://toon3d.studio/。