UDHF2-Net：基于不确定性扩散模型的高频Transformer网络用于遥感影像解译 (UDHF2-Net: Uncertainty-diffusion-model-based High-Frequency TransFormer Network for Remotely Sensed Imagery Interpretation)

Remotely sensed imagery interpretation (RSII) faces the three major problems: (1) objective representation of spatial distribution patterns; (2) edge uncertainty problem caused by downsampling encoder and intrinsic edge noises (e.g., mixed pixel and edge occlusion etc.); and (3) false detection problem caused by geometric registration error in change detection. To solve the aforementioned problems, uncertainty-diffusion-model-based high-Frequency TransFormer network (UDHF2-Net) is the first to be proposed, whose superiorities are as follows: (1) a spatially-stationary-and-non-stationary high-frequency connection paradigm (SHCP) is proposed to enhance the interaction of spatially frequency-wise stationary and non-stationary features to yield high-fidelity edge extraction result. Inspired by HRFormer, SHCP proposes high-frequency-wise stream to replace high-resolution-wise stream in HRFormer through the whole encoder-decoder process with parallel frequency-wise high-to-low streams, so it improves the edge extraction accuracy by continuously remaining high-frequency information; (2) a mask-and-geo-knowledge-based uncertainty diffusion module (MUDM), which is a self-supervised learning strategy, is proposed to improve the edge accuracy of extraction and change detection by gradually removing the simulated spectrum noises based on geo-knowledge and the generated diffused spectrum noises; (3) a frequency-wise semi-pseudo-Siamese UDHF2-Net is the first to be proposed to balance accuracy and complexity for change detection. Besides the aforementioned spectrum noises in semantic segmentation, MUDM is also a self-supervised learning strategy to effectively reduce the edge false change detection from the generated imagery with geometric registration error.

翻译：遥感影像解译面临三大主要问题：(1) 空间分布模式的客观表征；(2) 由下采样编码器及固有边缘噪声（如混合像元、边缘遮挡等）引起的边缘不确定性问题；(3) 变化检测中由几何配准误差导致的误检问题。为解决上述问题，本文首次提出了基于不确定性扩散模型的高频Transformer网络。其优势如下：(1) 提出了空间平稳与非平稳高频连接范式，通过增强空间频率维度上平稳与非平稳特征的交互，以产生高保真的边缘提取结果。受HRFormer启发，SHCP提出高频流，以取代HRFormer中的高分辨率流，贯穿整个具有并行频率维度高到低流的编码器-解码器过程，从而通过持续保留高频信息来提高边缘提取精度；(2) 提出了一种基于掩码与地理知识的无监督不确定性扩散模块，这是一种自监督学习策略，通过基于地理知识逐步去除模拟的光谱噪声及生成的扩散光谱噪声，以提高边缘提取和变化检测的精度；(3) 首次提出了频率维度半伪孪生UDHF2-Net，以在变化检测中平衡精度与复杂度。除语义分割中的上述光谱噪声外，MUDM也是一种自监督学习策略，能有效减少由存在几何配准误差的生成影像引起的边缘误变化检测。