Learning deep illumination-robust features from multispectral filter array images

Multispectral (MS) snapshot cameras equipped with a MS filter array (MSFA), capture multiple spectral bands in a single shot, resulting in a raw mosaic image where each pixel holds only one channel value. The fully-defined MS image is estimated from the raw one through $\textit{demosaicing}$, which inevitably introduces spatio-spectral artifacts. Moreover, training on fully-defined MS images can be computationally intensive, particularly with deep neural networks (DNNs), and may result in features lacking discrimination power due to suboptimal learning of spatio-spectral interactions. Furthermore, outdoor MS image acquisition occurs under varying lighting conditions, leading to illumination-dependent features. This paper presents an original approach to learn discriminant and illumination-robust features directly from raw images. It involves: $\textit{raw spectral constancy}$ to mitigate the impact of illumination, $\textit{MSFA-preserving}$ transformations suited for raw image augmentation to train DNNs on diverse raw textures, and $\textit{raw-mixing}$ to capture discriminant spatio-spectral interactions in raw images. Experiments on MS image classification show that our approach outperforms both handcrafted and recent deep learning-based methods, while also requiring significantly less computational effort.

翻译：配备多光谱滤光阵列（MSFA）的多光谱快照相机可在单次拍摄中捕获多个光谱波段，生成原始马赛克图像，其中每个像素仅包含一个通道值。通过$\textit{去马赛克}$过程从原始图像估计出完整定义的多光谱图像，但这不可避免地会引入空间-光谱伪影。此外，在完整定义的多光谱图像上进行训练计算量巨大，尤其是在使用深度神经网络（DNNs）时，并且可能由于空间-光谱交互学习的次优性导致特征缺乏区分能力。再者，室外多光谱图像采集在变化的光照条件下进行，导致特征依赖于光照。本文提出了一种直接从原始图像中学习判别性和光照鲁棒特征的创新方法。该方法包括：$\textit{原始光谱恒常性}$以减轻光照影响，适用于原始图像增强的$\textit{MSFA保持}$变换以训练DNNs处理多样化的原始纹理，以及$\textit{原始混合}$以捕获原始图像中具有判别力的空间-光谱交互。多光谱图像分类实验表明，我们的方法在显著减少计算量的同时，性能优于手工设计和近期基于深度学习的方法。

相关内容

关注 0

多媒体系统（MS）期刊详细介绍了多媒体计算，通信，存储和应用的各个方面的创新研究思想，新兴技术，最新方法和工具。它包含理论，实验和调查文章。多媒体系统的覆盖范围包括：在计算机系统中集成数字视频和音频功能；多媒体信息编码和数据交换格式；数字多媒体的操作系统机制；数字视频和音频网络与通信；存储模型和结构；用于支持多媒体应用程序的方法、范式、工具和软件体系结构；多媒体应用程序和应用程序接口，以及多媒体终端系统架构。官网地址：http://dblp.uni-trier.de/db/journals/mms/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日