In image editing employing diffusion models, it is crucial to preserve the reconstruction fidelity to the original image while changing its style. Although existing methods ensure reconstruction fidelity through optimization, a drawback of these is the significant amount of time required for optimization. In this paper, we propose negative-prompt inversion, a method capable of achieving equivalent reconstruction solely through forward propagation without optimization, thereby enabling ultrafast editing processes. We experimentally demonstrate that the reconstruction fidelity of our method is comparable to that of existing methods, allowing for inversion at a resolution of 512 pixels and with 50 sampling steps within approximately 5 seconds, which is more than 30 times faster than null-text inversion. Reduction of the computation time by the proposed method further allows us to use a larger number of sampling steps in diffusion models to improve the reconstruction fidelity with a moderate increase in computation time.
翻译:在使用扩散模型进行图像编辑时,在改变图像风格的同时保持对原始图像的重建保真度至关重要。尽管现有方法通过优化确保了重建保真度,但这些方法存在优化耗时显著的缺点。本文提出负向提示反转方法,该方法仅通过前向传播而无需优化即可实现等效重建,从而实现超快速编辑过程。我们通过实验证明,该方法的重建保真度与现有方法相当,能够在约5秒内完成512像素分辨率、50采样步数的反转,其速度比空文本反转方法快30倍以上。所提方法对计算时间的缩减进一步允许我们在扩散模型中使用更多采样步数,从而以适度的计算时间增加为代价提升重建保真度。