We introduce ReplaceAnything3D model (RAM3D), a novel text-guided 3D scene editing method that enables the replacement of specific objects within a scene. Given multi-view images of a scene, a text prompt describing the object to replace, and a text prompt describing the new object, our Erase-and-Replace approach can effectively swap objects in the scene with newly generated content while maintaining 3D consistency across multiple viewpoints. We demonstrate the versatility of ReplaceAnything3D by applying it to various realistic 3D scenes, showcasing results of modified foreground objects that are well-integrated with the rest of the scene without affecting its overall integrity.
翻译:我们提出ReplaceAnything3D模型(RAM3D),一种新颖的文本引导的3D场景编辑方法,能够实现场景中特定对象的替换。给定场景的多视角图像、描述待替换对象的文本提示以及描述新对象的文本提示,我们的"擦除与替换"方法可有效将场景中的对象替换为新生成的内容,同时保持多视角下的3D一致性。通过将ReplaceAnything3D应用于多种真实3D场景,我们展示了该方法的通用性。实验结果显示,修改后的前景对象能够与场景其余部分良好融合,且不影响场景的整体完整性。