We introduce a framework for intrinsic latent diffusion models operating directly on the surfaces of 3D shapes, with the goal of synthesizing high-quality textures. Our approach is underpinned by two contributions: field latents, a latent representation encoding textures as discrete vector fields on the mesh vertices, and field latent diffusion models, which learn to denoise a diffusion process in the learned latent space on the surface. We consider a single-textured-mesh paradigm, where our models are trained to generate variations of a given texture on a mesh. We show the synthesized textures are of superior fidelity compared those from existing single-textured-mesh generative models. Our models can also be adapted for user-controlled editing tasks such as inpainting and label-guided generation. The efficacy of our approach is due in part to the equivariance of our proposed framework under isometries, allowing our models to seamlessly reproduce details across locally similar regions and opening the door to a notion of generative texture transfer.
翻译:我们提出了一种直接在三维形状表面上运行的本征潜变量扩散模型框架,旨在合成高质量纹理。我们的方法基于两项贡献:场潜变量——一种将纹理编码为网格顶点上离散向量场的潜变量表示;以及场潜变量扩散模型——该模型学习在表面上的学习潜变量空间中对扩散过程进行去噪。我们采用单纹理网格范式,训练模型生成给定网格纹理的变体。实验表明,相较于现有单纹理网格生成模型,本方法合成的纹理具有更高的保真度。我们的模型还可适用于用户控制的编辑任务,如修复和标签引导生成。本方法的有效性部分源于所提框架在等距变换下的等变性,这使得模型能够无缝地在局部相似区域复现细节,并为生成式纹理迁移的概念开辟了道路。