《不止相位：创建相位对齐的周边视觉同色异谱图像》 (It's Not Just a Phase: Creating Phase-Aligned Peripheral Metamers)

Novel display technologies can deliver high-quality images across a wide field of view, creating immersive experiences. While rendering for such devices is expensive, most of the content falls into peripheral vision, where human perception differs from that in the fovea. Consequently, it is critical to understand and leverage the limitations of visual perception to enable efficient rendering. A standard approach is to exploit the reduced sensitivity to spatial details in the periphery by reducing rendering resolution, so-called foveated rendering. While this strategy avoids rendering part of the content altogether, an alternative promising direction is to replace accurate and expensive rendering with inexpensive synthesis of content that is perceptually indistinguishable from the ground-truth image. In this paper, we propose such a method for the efficient generation of an image signal that substitutes the rendering of high-frequency details. The method is grounded in findings from image statistics, which show that preserving appropriate local statistics is critical for perceived image quality. Based on this insight, we extrapolate several local image statistics from foveated content into higher spatial frequency ranges that are attenuated or omitted in the rendering process. This rich set of statistics is later used to synthesize a signal that is added to the initial rendering, boosting its perceived quality. We focus on phase information, demonstrating the importance of its alignment across space and frequencies. We calibrate and compare our method with state-of-the-art strategies, showing a significant reduction in the content that must be accurately rendered at a relatively small extra cost for synthesizing the additional signal.

翻译：新型显示技术能够在宽广视场内提供高质量图像，从而创造沉浸式体验。尽管为此类设备进行渲染计算成本高昂，但大部分内容都落入周边视觉区域，而人类在该区域的感知与中央凹区域存在差异。因此，理解和利用视觉感知的局限性对于实现高效渲染至关重要。标准方法是通过降低渲染分辨率来利用周边视觉对空间细节敏感度的降低，即所谓的注视点渲染。虽然该策略完全避免了部分内容的渲染，但另一种有前景的方向是用廉价合成的内容替代精确且昂贵的渲染，这些合成内容在感知上与真实图像无法区分。本文提出了一种高效生成图像信号的方法，用于替代高频细节的渲染。该方法基于图像统计学的发现，即保持适当的局部统计特性对感知图像质量至关重要。基于这一洞见，我们从注视点内容中推断出若干局部图像统计量，并将其外推至渲染过程中被衰减或忽略的更高空间频率范围。随后利用这组丰富的统计量合成信号，将其叠加到初始渲染结果上以提升感知质量。我们聚焦于相位信息，论证了其在空间和频率维度上保持对齐的重要性。通过对我们的方法与前沿策略进行校准和比较，结果表明在合成额外信号所需成本相对较低的情况下，需要精确渲染的内容量实现了显著减少。