We introduce Edify Image, a family of diffusion models capable of generating photorealistic image content with pixel-perfect accuracy. Edify Image utilizes cascaded pixel-space diffusion models trained using a novel Laplacian diffusion process, in which image signals at different frequency bands are attenuated at varying rates. Edify Image supports a wide range of applications, including text-to-image synthesis, 4K upsampling, ControlNets, 360 HDR panorama generation, and finetuning for image customization.
翻译:我们提出了Edify Image,这是一个能够以像素级精度生成逼真图像内容的扩散模型系列。Edify Image利用级联的像素空间扩散模型,通过一种新颖的拉普拉斯扩散过程进行训练,该过程中不同频段的图像信号以不同的速率衰减。Edify Image支持广泛的应用,包括文本到图像合成、4K超分辨率、ControlNets、360度HDR全景图生成以及用于图像定制的微调。