Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification

The effectiveness of spectral-spatial feature learning is crucial for the hyperspectral image (HSI) classification task. Diffusion models, as a new class of groundbreaking generative models, have the ability to learn both contextual semantics and textual details from the distinct timestep dimension, enabling the modeling of complex spectral-spatial relations in HSIs. However, existing diffusion-based HSI classification methods only utilize manually selected single-timestep single-stage features, limiting the full exploration and exploitation of rich contextual semantics and textual information hidden in the diffusion model. To address this issue, we propose a novel diffusion-based feature learning framework that explores Multi-Timestep Multi-Stage Diffusion features for HSI classification for the first time, called MTMSD. Specifically, the diffusion model is first pretrained with unlabeled HSI patches to mine the connotation of unlabeled data, and then is used to extract the multi-timestep multi-stage diffusion features. To effectively and efficiently leverage multi-timestep multi-stage features,two strategies are further developed. One strategy is class & timestep-oriented multi-stage feature purification module with the inter-class and inter-timestep prior for reducing the redundancy of multi-stage features and alleviating memory constraints. The other one is selective timestep feature fusion module with the guidance of global features to adaptively select different timestep features for integrating texture and semantics. Both strategies facilitate the generality and adaptability of the MTMSD framework for diverse patterns of different HSI data. Extensive experiments are conducted on four public HSI datasets, and the results demonstrate that our method outperforms state-of-the-art methods for HSI classification, especially on the challenging Houston 2018 dataset.

翻译：光谱-空间特征学习的有效性对于高光谱图像（HSI）分类任务至关重要。扩散模型作为一类突破性的新型生成模型，能够从独特的时间步维度学习上下文语义与纹理细节，从而实现对HSI中复杂光谱-空间关系的建模。然而，现有基于扩散的HSI分类方法仅利用人工选择的单时间步单阶段特征，限制了对扩散模型中隐藏的丰富上下文语义与纹理信息的充分挖掘与利用。为解决此问题，我们首次提出一种探索多时间步多阶段扩散特征用于HSI分类的新型扩散特征学习框架，称为MTMSD。具体而言，首先使用无标签HSI图像块预训练扩散模型以挖掘无标签数据的内涵，随后利用该模型提取多时间步多阶段扩散特征。为高效且有效地利用多时间步多阶段特征，我们进一步提出两种策略：其一是面向类别与时间步的多阶段特征纯化模块，通过引入类间与时间步间先验来减少多阶段特征的冗余并缓解内存限制；其二是选择性时间步特征融合模块，在全局特征引导下自适应选择不同时间步特征以整合纹理与语义信息。两种策略共同提升了MTMSD框架对于不同HSI数据多样模式的泛化性与适应性。在四个公开HSI数据集上的大量实验表明，我们的方法在HSI分类任务上优于现有先进方法，尤其在具有挑战性的Houston 2018数据集上表现突出。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日