Multi-view image generation attracts particular attention these days due to its promising 3D-related applications, e.g., image viewpoint editing. Most existing methods follow a paradigm where a 3D representation is first synthesized, and then rendered into 2D images to ensure photo-consistency across viewpoints. However, such explicit bias for photo-consistency sacrifices photo-realism, causing geometry artifacts and loss of fine-scale details when these methods are applied to edit real images. To address this issue, we propose ray conditioning, a geometry-free alternative that relaxes the photo-consistency constraint. Our method generates multi-view images by conditioning a 2D GAN on a light field prior. With explicit viewpoint control, state-of-the-art photo-realism and identity consistency, our method is particularly suited for the viewpoint editing task.
翻译:多视点图像生成因其在三维相关应用(如图像视角编辑)中的潜力而备受关注。现有方法大多遵循一种先合成三维表征、再渲染为二维图像的范式,以确保跨视角的光度一致性。然而,这种对光度一致性的显式偏好牺牲了照片真实感,导致将此类方法应用于真实图像编辑时出现几何伪影和精细细节损失。为解决此问题,我们提出射线条件——一种无需几何信息的替代方案,可放宽光度一致性约束。该方法通过以光场先验为条件约束二维生成对抗网络来生成多视点图像。凭借显式视角控制、领先的照片真实感与身份一致性,我们的方法特别适用于视角编辑任务。