We introduce a framework for intrinsic latent diffusion models operating directly on the surfaces of 3D shapes, with the goal of synthesizing high-quality textures. Our approach is underpinned by two contributions: field latents, a latent representation encoding textures as discrete vector fields on the mesh vertices, and field latent diffusion models, which learn to denoise a diffusion process in the learned latent space on the surface. We consider a single-textured-mesh paradigm, where our models are trained to generate variations of a given texture on a mesh. We show the synthesized textures are of superior fidelity compared those from existing single-textured-mesh generative models. Our models can also be adapted for user-controlled editing tasks such as inpainting and label-guided generation. The efficacy of our approach is due in part to the equivariance of our proposed framework under isometries, allowing our models to seamlessly reproduce details across locally similar regions and opening the door to a notion of generative texture transfer.
翻译:我们提出了一种直接在三维形状表面进行内在潜变量扩散的框架,旨在合成高质量纹理。该方法基于两项贡献:场潜变量——一种将纹理编码为网格顶点上离散向量场的潜变量表示,以及场潜变量扩散模型——通过学习在曲面上学习到的潜空间中对扩散过程进行去噪。我们采用单纹理网格范式,训练模型生成网格上给定纹理的变体。实验表明,与现有单纹理网格生成模型相比,本方法合成的纹理具有更优的保真度。该模型还可适配用户可控的编辑任务,如图像修复和标签引导生成。本方法的有效性部分源于所提框架在等距变换下的等变性,使得模型能够无缝地在局部相似区域间复现细节,并开创了生成式纹理迁移的概念。