Existing 3D editing methods rely on computationally intensive scene-by-scene iterative optimization and suffer from multi-view inconsistency. We propose an effective and feed-forward 3D editing framework based on the TRELLIS generative backbone, capable of modifying 3D models from a single editing view. Our framework addresses two key issues: adapting training-free 2D editing to structured 3D representations, and overcoming the bottleneck of appearance fidelity in compressed 3D features. To ensure geometric consistency, we introduce Voxel FlowEdit, an edit-driven flow in the sparse voxel latent space that achieves globally consistent 3D deformation in a single pass. To restore high-fidelity details, we develop a normal-guided single to multi-view generation module as an external appearance prior, successfully recovering high-frequency textures. Experiments demonstrate that our method enables fast, globally consistent, and high-fidelity 3D model editing.
翻译:现有三维编辑方法依赖逐场景迭代优化,计算开销大且存在多视角不一致问题。本文提出一种基于TRELLIS生成骨架的高效前馈式三维编辑框架,仅需单个编辑视角即可修改三维模型。该框架解决两个关键问题:将免训练二维编辑适配至结构化三维表示,以及突破压缩三维特征中外观保真度的瓶颈。为确保几何一致性,我们提出Voxel FlowEdit——在稀疏体素隐空间中的编辑驱动流,可单次实现全局一致的三维变形。为恢复高保真细节,我们开发了法线引导的单视角到多视角生成模块作为外部外观先验,成功复原高频纹理。实验表明,本方法能实现快速、全局一致且高保真度的三维模型编辑。