We present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D reference image to guide the stages of geometry sculpting and texture boosting. A central focus of this work is to address the consistency issue that existing works encounter. To sculpt geometries that render coherently, we perform score distillation sampling via a view-dependent diffusion model. This 3D prior, alongside several training strategies, prioritizes the geometry consistency but compromises the texture fidelity. We further propose Bootstrapped Score Distillation to specifically boost the texture. We train a personalized diffusion model, Dreambooth, on the augmented renderings of the scene, imbuing it with 3D knowledge of the scene being optimized. The score distillation from this 3D-aware diffusion prior provides view-consistent guidance for the scene. Notably, through an alternating optimization of the diffusion prior and 3D scene representation, we achieve mutually reinforcing improvements: the optimized 3D scene aids in training the scene-specific diffusion model, which offers increasingly view-consistent guidance for 3D optimization. The optimization is thus bootstrapped and leads to substantial texture boosting. With tailored 3D priors throughout the hierarchical generation, DreamCraft3D generates coherent 3D objects with photorealistic renderings, advancing the state-of-the-art in 3D content generation. Code available at https://github.com/deepseek-ai/DreamCraft3D.
翻译:我们提出DreamCraft3D——一种分层三维内容生成方法,能够生成高保真且连贯的三维对象。通过利用二维参考图像引导几何雕刻与纹理增强阶段,我们解决了现有方法中普遍存在的连贯性问题。为生成渲染一致的三维几何,我们采用基于视角相关扩散模型的分数蒸馏采样。这种三维先验配合多种训练策略,虽优先保证了几何连贯性,但牺牲了纹理保真度。为此,我们提出引导式分数蒸馏方法专门增强纹理效果:对场景的增强渲染结果训练个性化扩散模型Dreambooth,使其融入待优化场景的三维知识;该三维感知扩散先验的分数蒸馏为场景提供视角一致的引导。值得注意的是,通过扩散先验与三维场景表征的交替优化,我们实现了相互强化的提升——优化的三维场景有助于训练场景专用扩散模型,而该模型又能为三维优化提供更具视角一致性的引导。这种正反馈优化机制最终实现纹理效果的显著增强。通过在整个分层生成过程中定制三维先验,DreamCraft3D能够生成具有照片级渲染效果的三维连贯对象,推动了三维内容生成领域的技术发展。相关代码开源在https://github.com/deepseek-ai/DreamCraft3D。