HySpecNet-11k: A Large-Scale Hyperspectral Dataset for Benchmarking Learning-Based Hyperspectral Image Compression Methods

from arxiv, Accepted at IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2023. The dataset, our code and the pre-trained weights are publicly available at https://hyspecnet.rsim.berlin

The development of learning-based hyperspectral image compression methods has recently attracted great attention in remote sensing. Such methods require a high number of hyperspectral images to be used during training to optimize all parameters and reach a high compression performance. However, existing hyperspectral datasets are not sufficient to train and evaluate learning-based compression methods, which hinders the research in this field. To address this problem, in this paper we present HySpecNet-11k that is a large-scale hyperspectral benchmark dataset made up of 11,483 nonoverlapping image patches. Each patch is a portion of 128 $\times$ 128 pixels with 224 spectral bands and a ground sample distance of 30 m. We exploit HySpecNet-11k to benchmark the current state of the art in learning-based hyperspectral image compression by focussing our attention on various 1D, 2D and 3D convolutional autoencoder architectures. Nevertheless, HySpecNet-11k can be used for any unsupervised learning task in the framework of hyperspectral image analysis. The dataset, our code and the pre-trained weights are publicly available at https://hyspecnet.rsim.berlin .

翻译：近年来，基于学习的高光谱图像压缩方法的发展在遥感领域引起了广泛关注。此类方法在训练过程中需要大量高光谱图像来优化所有参数并达到较高的压缩性能。然而，现有高光谱数据集不足以训练和评估基于学习的压缩方法，这阻碍了该领域的研究进展。为解决这一问题，本文提出HySpecNet-11k，一个由11483个非重叠图像块组成的大规模高光谱基准数据集。每个图像块为128×128像素，包含224个光谱波段，地面采样距离为30米。我们利用HySpecNet-11k对当前基于学习的高光谱图像压缩技术进行了基准测试，重点研究了多种一维、二维和三维卷积自编码器架构。此外，HySpecNet-11k还可用于高光谱图像分析框架下的任何无监督学习任务。该数据集、相关代码及预训练权重已在https://hyspecnet.rsim.berlin 公开提供。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日