We present Image Sculpting, a new framework for editing 2D images by incorporating tools from 3D geometry and graphics. This approach differs markedly from existing methods, which are confined to 2D spaces and typically rely on textual instructions, leading to ambiguity and limited control. Image Sculpting converts 2D objects into 3D, enabling direct interaction with their 3D geometry. Post-editing, these objects are re-rendered into 2D, merging into the original image to produce high-fidelity results through a coarse-to-fine enhancement process. The framework supports precise, quantifiable, and physically-plausible editing options such as pose editing, rotation, translation, 3D composition, carving, and serial addition. It marks an initial step towards combining the creative freedom of generative models with the precision of graphics pipelines.
翻译:我们提出Image Sculpting框架,通过融合3D几何与图形学工具实现2D图像的编辑。该方法与现有局限于2D空间且通常依赖文本指令(易导致歧义和有限控制)的技术截然不同。Image Sculpting将2D物体转换为3D表示,支持对其3D几何的直接交互操作。编辑完成后,这些物体会重新渲染为2D形式,通过从粗到精的增强过程融合回原始图像以产生高保真结果。该框架支持姿态编辑、旋转、平移、3D组合、雕刻及序列添加等精确、可量化且符合物理规律的编辑选项。这标志着向融合生成模型的创作自由与图形管线的精准控制迈出了第一步。