The growing prevalence of high-resolution displays on edge devices has created a pressing need for efficient high dynamic range (HDR) imaging algorithms. However, most existing HDR methods either struggle to deliver satisfactory visual quality or incur high computational and memory costs, limiting their applicability to high-resolution inputs (typically exceeding 12 megapixels). Furthermore, current HDR dataset collection approaches are often labor-intensive and inefficient. In this work, we explore a novel and practical solution for HDR reconstruction directly from raw sensor data, aiming to enhance both performance and deployability on mobile platforms. Our key insights are threefold: (1) we propose RepUNet, a lightweight and efficient HDR network leveraging structural re-parameterization for fast and robust inference; (2) we design a new computational raw HDR data formation pipeline and construct a new raw HDR dataset, RealRaw-HDR; (3) we design a plug-and-play motion alignment loss to suppress ghosting artifacts under constrained bandwidth conditions effectively. Our model contains fewer than 830K parameters and takes less than 3 ms to process an image of 4K resolution using one RTX 3090 GPU. While being highly efficient, our model also achieves comparable performance to state-of-the-art HDR methods in terms of PSNR, SSIM, and a color difference metric.
翻译:随着边缘设备上高分辨率显示器的日益普及,对高效高动态范围成像算法的需求变得尤为迫切。然而,现有的大多数高动态范围方法要么难以提供令人满意的视觉质量,要么产生高昂的计算和内存开销,这限制了它们对高分辨率输入(通常超过1200万像素)的适用性。此外,当前的高动态范围数据集采集方法通常劳动密集且效率低下。在本工作中,我们探索了一种直接从原始传感器数据进行高动态范围重建的新颖且实用的解决方案,旨在提升移动平台上的性能和部署能力。我们的核心见解有三点:(1)我们提出了RepUNet,一个轻量级且高效的高动态范围网络,利用结构重参数化实现快速且鲁棒的推理;(2)我们设计了一种新的计算式原始高动态范围数据生成流程,并构建了一个新的原始高动态范围数据集RealRaw-HDR;(3)我们设计了一种即插即用的运动对齐损失函数,以在受限带宽条件下有效抑制重影伪影。我们的模型包含少于83万个参数,使用一块RTX 3090 GPU处理一张4K分辨率图像耗时不到3毫秒。在保持高效率的同时,我们的模型在峰值信噪比、结构相似性指数和色差度量方面也达到了与最先进高动态范围方法相当的性能。