HSIGene: A Foundation Model For Hyperspectral Image Generation

Hyperspectral image (HSI) plays a vital role in various fields such as agriculture and environmental monitoring. However, due to the expensive acquisition cost, the number of hyperspectral images is limited, degenerating the performance of downstream tasks. Although some recent studies have attempted to employ diffusion models to synthesize HSIs, they still struggle with the scarcity of HSIs, affecting the reliability and diversity of the generated images. Some studies propose to incorporate multi-modal data to enhance spatial diversity, but the spectral fidelity cannot be ensured. In addition, existing HSI synthesis models are typically uncontrollable or only support single-condition control, limiting their ability to generate accurate and reliable HSIs. To alleviate these issues, we propose HSIGene, a novel HSI generation foundation model which is based on latent diffusion and supports multi-condition control, allowing for more precise and reliable HSI generation. To enhance the spatial diversity of the training data while preserving spectral fidelity, we propose a new data augmentation method based on spatial super-resolution, in which HSIs are upscaled first, and thus abundant training patches could be obtained by cropping the high-resolution HSIs. In addition, to improve the perceptual quality of the augmented data, we introduce a novel two-stage HSI super-resolution framework, which first applies RGB bands super-resolution and then utilizes our proposed Rectangular Guided Attention Network (RGAN) for guided HSI super-resolution. Experiments demonstrate that the proposed model is capable of generating a vast quantity of realistic HSIs for downstream tasks such as denoising and super-resolution. The code and models are available at https://github.com/LiPang/HSIGene.

翻译：高光谱图像（HSI）在农业和环境监测等多个领域发挥着至关重要的作用。然而，由于采集成本高昂，高光谱图像的数量有限，导致下游任务的性能下降。尽管最近的一些研究尝试利用扩散模型合成HSI，但它们仍然受限于HSI的稀缺性，影响了生成图像的可靠性和多样性。一些研究提出结合多模态数据以增强空间多样性，但无法保证光谱保真度。此外，现有的HSI合成模型通常不可控或仅支持单条件控制，限制了其生成准确可靠HSI的能力。为缓解这些问题，我们提出了HSIGene，一种基于潜在扩散并支持多条件控制的新型HSI生成基础模型，能够实现更精确可靠的HSI生成。为了在保持光谱保真度的同时增强训练数据的空间多样性，我们提出了一种基于空间超分辨率的新数据增强方法，即首先对HSI进行上采样，从而通过裁剪高分辨率HSI获得丰富的训练图像块。此外，为提高增强数据的感知质量，我们引入了一种新颖的两阶段HSI超分辨率框架，该框架首先应用RGB波段超分辨率，然后利用我们提出的矩形引导注意力网络（RGAN）进行引导式HSI超分辨率。实验表明，所提模型能够为去噪和超分辨率等下游任务生成大量逼真的HSI。代码和模型可在 https://github.com/LiPang/HSIGene 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日