The advent of 3D Gaussian Splatting (3DGS) has revolutionized 3D editing, offering efficient, high-fidelity rendering and enabling precise local manipulations. Currently, diffusion-based 2D editing models are harnessed to modify multi-view rendered images, which then guide the editing of 3DGS models. However, this approach faces a critical issue of multi-view inconsistency, where the guidance images exhibit significant discrepancies across views, leading to mode collapse and visual artifacts of 3DGS. To this end, we introduce View-consistent Editing (VcEdit), a novel framework that seamlessly incorporates 3DGS into image editing processes, ensuring multi-view consistency in edited guidance images and effectively mitigating mode collapse issues. VcEdit employs two innovative consistency modules: the Cross-attention Consistency Module and the Editing Consistency Module, both designed to reduce inconsistencies in edited images. By incorporating these consistency modules into an iterative pattern, VcEdit proficiently resolves the issue of multi-view inconsistency, facilitating high-quality 3DGS editing across a diverse range of scenes. Further video results are shown in http://vcedit.github.io.
翻译:三维高斯溅射(3DGS)的出现彻底改变了三维编辑领域,它提供了高效、高保真的渲染能力,并支持精确的局部操作。当前,基于扩散的二维编辑模型被用于修改多视角渲染图像,进而指导3DGS模型的编辑。然而,该方法面临一个关键的多视角不一致性问题:引导图像在不同视角间存在显著差异,导致3DGS出现模式崩溃和视觉伪影。为此,我们提出了视点一致编辑(VcEdit)这一新颖框架,该框架将3DGS无缝集成到图像编辑流程中,确保编辑后引导图像的多视角一致性,并有效缓解模式崩溃问题。VcEdit采用两个创新的一致性模块:交叉注意力一致性模块与编辑一致性模块,二者均旨在减少编辑图像中的不一致性。通过将这些一致性模块嵌入迭代式处理流程,VcEdit能够有效解决多视角不一致问题,从而在多样化场景中实现高质量的3DGS编辑。更多视频结果展示于 http://vcedit.github.io。