AnyThermal：迈向学习通用热感知表征 (AnyThermal: Towards Learning Universal Representations for Thermal Perception)

We present AnyThermal, a thermal backbone that captures robust task-agnostic thermal features suitable for a variety of tasks such as cross-modal place recognition, thermal segmentation, and monocular depth estimation using thermal images. Existing thermal backbones that follow task-specific training from small-scale data result in utility limited to a specific environment and task. Unlike prior methods, AnyThermal can be used for a wide range of environments (indoor, aerial, off-road, urban) and tasks, all without task-specific training. Our key insight is to distill the feature representations from visual foundation models such as DINOv2 into a thermal encoder using thermal data from these multiple environments. To bridge the diversity gap of the existing RGB-Thermal datasets, we introduce the TartanRGBT platform, the first open-source data collection platform with synced RGB-Thermal image acquisition. We use this payload to collect the TartanRGBT dataset - a diverse and balanced dataset collected in 4 environments. We demonstrate the efficacy of AnyThermal and TartanRGBT, achieving state-of-the-art results with improvements of up to 36% across diverse environments and downstream tasks on existing datasets.

翻译：我们提出了AnyThermal，一种能够捕获鲁棒、任务无关热特征的热成像骨干网络，适用于跨模态地点识别、热图像分割、热图像单目深度估计等多种任务。现有热成像骨干网络通常基于小规模数据进行任务特定训练，导致其效用局限于特定环境和任务。与先前方法不同，AnyThermal无需任务特定训练即可广泛适用于多种环境（室内、空中、越野、城市）和任务。我们的核心思路是利用来自多个环境的热数据，将视觉基础模型（如DINOv2）的特征表征蒸馏到热编码器中。为弥合现有RGB-热成像数据集的多样性鸿沟，我们推出了TartanRGBT平台——首个开源的同步采集RGB与热图像的数据收集平台。我们利用该载荷收集了TartanRGBT数据集，这是一个在4种环境中采集的多样化且平衡的数据集。我们验证了AnyThermal与TartanRGBT的有效性，在现有数据集上的多种环境和下游任务中实现了最高性能，提升幅度最高达36%。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【KDD2024】MAML-en-LLM: 用于改进上下文学习的大型语言模型的模型无关元训练

专知会员服务

19+阅读 · 2024年5月21日

【TOIS2022】TOIS：基于元学习的冷启动序列推荐，Learning to Learn a Cold-start Sequential Recommender

专知会员服务

10+阅读 · 2022年3月29日

NeurIPS 2021 | 又一超强视觉Transformer主干！HRFormer：学习高分辨率表征

专知会员服务

18+阅读 · 2021年12月8日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日