CD-CTFM: A Lightweight CNN-Transformer Network for Remote Sensing Cloud Detection Fusing Multiscale Features

Clouds in remote sensing images inevitably affect information extraction, which hinder the following analysis of satellite images. Hence, cloud detection is a necessary preprocessing procedure. However, the existing methods have numerous calculations and parameters. In this letter, a lightweight CNN-Transformer network, CD-CTFM, is proposed to solve the problem. CD-CTFM is based on encoder-decoder architecture and incorporates the attention mechanism. In the decoder part, we utilize a lightweight network combing CNN and Transformer as backbone, which is conducive to extract local and global features simultaneously. Moreover, a lightweight feature pyramid module is designed to fuse multiscale features with contextual information. In the decoder part, we integrate a lightweight channel-spatial attention module into each skip connection between encoder and decoder, extracting low-level features while suppressing irrelevant information without introducing many parameters. Finally, the proposed model is evaluated on two cloud datasets, 38-Cloud and MODIS. The results demonstrate that CD-CTFM achieves comparable accuracy as the state-of-art methods. At the same time, CD-CTFM outperforms state-of-art methods in terms of efficiency.

翻译：遥感图像中的云层会不可避免地影响信息提取，从而阻碍卫星图像的后续分析。因此，云检测是一项必要的预处理步骤。然而，现有方法存在计算量与参数规模过大的问题。本文提出一种轻量级CNN-Transformer网络CD-CTFM来解决该问题。CD-CTFM基于编码器-解码器架构，并融合了注意力机制。在解码器部分，我们采用结合CNN与Transformer的轻量级网络作为主干网络，这有助于同时提取局部与全局特征。此外，我们设计了一个轻量级特征金字塔模块，用于融合包含上下文信息的多尺度特征。在解码器部分，我们将轻量级通道-空间注意力模块集成到编码器与解码器之间的每个跳跃连接中，在提取底层特征的同时抑制无关信息，且不引入过多参数。最后，在两个云数据集（38-Cloud与MODIS）上对提出的模型进行了评估。结果表明，CD-CTFM在精度上达到了与现有最优方法相当的水平，同时在效率上优于现有最优方法。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日