RS-Mamba for Large Remote Sensing Image Dense Prediction

The spatial resolution of remote sensing images is becoming increasingly higher, posing challenges in handling large very-high-resolution (VHR) remote sensing images for dense prediction tasks. Models based on convolutional neural networks are limited in their ability to model global features of remote sensing images due to local convolution operations. Transformer based models, despite their global modeling capabilities, face computational challenges with large VHR images due to their quadratic complexity. The common practice of cropping large images into smaller patches leads to a significant loss of contextual information. To address these issues, we propose the Remote Sensing Mamba (RSM) for dense prediction tasks in VHR remote sensing. RSM is designed to model global features of remote sensing images with linear complexity, enabling it to process large VHR images effectively. It employs an omnidirectional selective scan module to globally model the images in multiple directions, capturing large spatial features from various directions. Experiments on semantic segmentation and change detection tasks across various objects demonstrate the effectiveness of RSM. With simple model architecture and training approach, RSM achieves state-of-the-art performance on the dense prediction tasks of VHR remote sensing. The code for this work will be available at https://github.com/walking-shadow/Official_Remote_Sensing_Mamba.

翻译：随着遥感图像空间分辨率日益提高，大范围甚高分辨率遥感图像的稠密预测任务面临巨大挑战。基于卷积神经网络的模型受限于局部卷积操作，难以有效建模遥感图像的全局特征。基于Transformer的模型虽具备全局建模能力，但由于其二次复杂度，在处理大范围甚高分辨率图像时面临计算挑战。将大图像裁剪成小块的常规做法会导致上下文信息的显著丢失。为解决这些问题，我们提出遥感Mamba模型用于甚高分辨率遥感图像稠密预测。该模型以线性复杂度建模遥感图像全局特征，能够高效处理大范围甚高分辨率图像。其采用全向选择性扫描模块对图像进行多方向全局建模，从不同方向捕获大范围空间特征。在多种地物类型的语义分割和变化检测任务上的实验证明了该模型的有效性。凭借简洁的模型架构和训练方法，该模型在甚高分辨率遥感图像稠密预测任务中达到领先性能。本工作代码将发布在 https://github.com/walking-shadow/Official_Remote_Sensing_Mamba。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日