Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing

Self-supervised frameworks for representation learning have recently stirred up interest among the remote sensing community, given their potential to mitigate the high labeling costs associated with curating large satellite image datasets. In the realm of multimodal data fusion, while the often used contrastive learning methods can help bridging the domain gap between different sensor types, they rely on data augmentations techniques that require expertise and careful design, especially for multispectral remote sensing data. A possible but rather scarcely studied way to circumvent these limitations is to use a masked image modelling based pretraining strategy. In this paper, we introduce Fus-MAE, a self-supervised learning framework based on masked autoencoders that uses cross-attention to perform early and feature-level data fusion between synthetic aperture radar and multispectral optical data - two modalities with a significant domain gap. Our empirical findings demonstrate that Fus-MAE can effectively compete with contrastive learning strategies tailored for SAR-optical data fusion and outperforms other masked-autoencoders frameworks trained on a larger corpus.

翻译：自监督表征学习框架因其在缓解大规模卫星图像数据集标注成本高昂问题方面的潜力，近期引起了遥感领域的广泛关注。在多模态数据融合中，尽管常用的对比学习方法有助于弥合不同传感器类型之间的领域差异，但其依赖的数据增强技术需要专业知识与精心设计——尤其针对多光谱遥感数据。一种可能但鲜有研究的解决方案是采用基于掩码图像建模的预训练策略。本文提出Fus-MAE，一种基于掩码自编码器的自监督学习框架，该框架通过交叉注意力在合成孔径雷达与多光谱光学数据（两者存在显著领域差异）之间实现早期特征级数据融合。实验结果表明，Fus-MAE能够有效媲美专门针对SAR-光学数据融合设计的对比学习策略，并优于基于更大语料库训练的其他掩码自编码器框架。

相关内容

自编码器

关注 141

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日