EFLNet: Enhancing Feature Learning for Infrared Small Target Detection

Single-frame infrared small target detection is considered to be a challenging task, due to the extreme imbalance between target and background, bounding box regression is extremely sensitive to infrared small target, and target information is easy to lose in the high-level semantic layer. In this article, we propose an enhancing feature learning network (EFLNet) to address these problems. First, we notice that there is an extremely imbalance between the target and the background in the infrared image, which makes the model pay more attention to the background features rather than target features. To address this problem, we propose a new adaptive threshold focal loss (ATFL) function that decouples the target and the background, and utilizes the adaptive mechanism to adjust the loss weight to force the model to allocate more attention to target features. Second, we introduce the normalized Gaussian Wasserstein distance (NWD) to alleviate the difficulty of convergence caused by the extreme sensitivity of the bounding box regression to infrared small target. Finally, we incorporate a dynamic head mechanism into the network to enable adaptive learning of the relative importance of each semantic layer. Experimental results demonstrate our method can achieve better performance in the detection performance of infrared small target compared to the state-of-the-art (SOTA) deep-learning-based methods. The source codes and bounding box annotated datasets are available at https://github.com/YangBo0411/infrared-small-target.

翻译：单帧红外小目标检测是一项具有挑战性的任务，这主要是由于目标与背景之间存在极度不平衡，边界框回归对红外小目标极为敏感，以及高层语义层中目标信息容易丢失。本文提出了一种增强特征学习网络（EFLNet）来解决这些问题。首先，我们发现红外图像中目标与背景之间存在极度不平衡，这使得模型更关注背景特征而非目标特征。为解决此问题，我们提出了一种新的自适应阈值焦点损失（ATFL）函数，该函数将目标与背景解耦，并利用自适应机制调整损失权重，迫使模型将更多注意力分配给目标特征。其次，我们引入归一化高斯瓦瑟斯坦距离（NWD），以缓解因边界框回归对红外小目标极度敏感而导致的收敛困难。最后，我们在网络中融入动态头部机制，使网络能够自适应学习每个语义层的相对重要性。实验结果表明，与当前最先进的（SOTA）基于深度学习方法相比，我们的方法在红外小目标检测性能上能取得更优效果。源代码及边界框标注数据集已公开于https://github.com/YangBo0411/infrared-small-target。

相关内容

表征学习

关注 152

在机器学习中，表征学习或表示学习是允许系统从原始数据中自动发现特征检测或分类所需的表示的一组技术。这取代了手动特征工程，并允许机器学习特征并使用它们执行特定任务。在有监督的表征学习中，使用标记的输入数据来学习特征，包括监督神经网络，多层感知器和（监督）字典学习。在无监督表征学习中，特征是与未标记的输入数据一起学习的，包括字典学习，独立成分分析，自动编码器，矩阵分解和各种形式的聚类。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日