RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation

Current deep learning approaches in computer vision primarily focus on RGB data sacrificing information. In contrast, RAW images offer richer representation, which is crucial for precise recognition, particularly in challenging conditions like low-light environments. The resultant demand for comprehensive RAW image datasets contrasts with the labor-intensive process of creating specific datasets for individual sensors. To address this, we propose a novel diffusion-based method for generating RAW images guided by RGB images. Our approach integrates an RGB-guidance module for feature extraction from RGB inputs, then incorporates these features into the reverse diffusion process with RGB-guided residual blocks across various resolutions. This approach yields high-fidelity RAW images, enabling the creation of camera-specific RAW datasets. Our RGB2RAW experiments on four DSLR datasets demonstrate state-of-the-art performance. Moreover, RAW-Diffusion demonstrates exceptional data efficiency, achieving remarkable performance with as few as 25 training samples or even fewer. We extend our method to create BDD100K-RAW and Cityscapes-RAW datasets, revealing its effectiveness for object detection in RAW imagery, significantly reducing the amount of required RAW images.

翻译：当前计算机视觉领域的深度学习方法主要集中于RGB数据，这牺牲了部分信息。相比之下，RAW图像提供了更丰富的表征，这对于精确识别至关重要，尤其是在低光照等挑战性条件下。由此产生的对全面RAW图像数据集的需求，与为单个传感器创建特定数据集所需的劳动密集型过程形成了鲜明对比。为解决此问题，我们提出了一种新颖的、基于扩散的、由RGB图像引导的RAW图像生成方法。我们的方法集成了一个RGB引导模块，用于从RGB输入中提取特征，然后通过跨不同分辨率的RGB引导残差块将这些特征整合到反向扩散过程中。该方法能够生成高保真的RAW图像，从而能够创建针对特定相机的RAW数据集。我们在四个DSLR数据集上进行的RGB2RAW实验展示了最先进的性能。此外，RAW-Diffusion展现了卓越的数据效率，仅需25个甚至更少的训练样本即可实现出色的性能。我们扩展了该方法，创建了BDD100K-RAW和Cityscapes-RAW数据集，揭示了其在RAW图像目标检测中的有效性，并显著减少了所需的RAW图像数量。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日