Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation

Hyperspectral imaging (HSI) captures detailed spectral signatures across hundreds of contiguous bands per pixel, being indispensable for remote sensing applications such as land-cover classification, change detection, and environmental monitoring. Due to the high dimensionality of HSI data and the slow rate of data transfer in satellite-based systems, compact and efficient models are required to support onboard processing and minimize the transmission of redundant or low-value data. To this end, we introduce a novel curriculum multi-task self-supervised learning (CMTSSL) framework designed for lightweight architectures for HSI analysis. CMTSSL integrates masked image modeling with decoupled spatial and spectral jigsaw puzzle solving, guided by a curriculum learning strategy that progressively increases data difficulty during self-supervision. This enables the encoder to jointly capture fine-grained spectral continuity, spatial structure, and global semantic features. Unlike prior dual-task SSL methods, CMTSSL simultaneously addresses spatial and spectral reasoning within a unified and computationally efficient design, being particularly suitable for training lightweight models for onboard satellite deployment. We validate our approach on four public benchmark datasets, demonstrating consistent gains in downstream segmentation tasks, using architectures that are over 16,000x lighter than some state-of-the-art models. These results highlight the potential of CMTSSL in generalizable representation learning with lightweight architectures for real-world HSI applications. Our code is publicly available at https://github.com/hugocarlesso/CMTSSL.

翻译：高光谱成像（HSI）通过每个像素数百个连续波段捕获详细的光谱特征，对于土地覆盖分类、变化检测和环境监测等遥感应用不可或缺。由于HSI数据的高维特性以及星载系统数据传输速率较低，需要紧凑高效的模型以支持星上处理并最小化冗余或低价值数据的传输。为此，我们提出了一种专为HSI分析轻量级架构设计的新型课程式多任务自监督学习（CMTSSL）框架。CMTSSL将掩码图像建模与解耦的空间及光谱拼图求解相结合，并通过课程学习策略引导，在自监督过程中逐步增加数据难度。这使得编码器能够同时捕获细粒度光谱连续性、空间结构和全局语义特征。与先前的双任务自监督学习方法不同，CMTSSL在统一且计算高效的设计中同时处理空间与光谱推理，特别适用于训练面向星载部署的轻量级模型。我们在四个公共基准数据集上验证了该方法，使用比某些先进模型轻16,000倍以上的架构，在下游分割任务中展现出持续的性能提升。这些结果凸显了CMTSSL在面向实际HSI应用的轻量级架构可泛化表征学习中的潜力。我们的代码公开于https://github.com/hugocarlesso/CMTSSL。