Few-Shot Object Detection via Variational Feature Aggregation

As few-shot object detectors are often trained with abundant base samples and fine-tuned on few-shot novel examples,the learned models are usually biased to base classes and sensitive to the variance of novel examples. To address this issue, we propose a meta-learning framework with two novel feature aggregation schemes. More precisely, we first present a Class-Agnostic Aggregation (CAA) method, where the query and support features can be aggregated regardless of their categories. The interactions between different classes encourage class-agnostic representations and reduce confusion between base and novel classes. Based on the CAA, we then propose a Variational Feature Aggregation (VFA) method, which encodes support examples into class-level support features for robust feature aggregation. We use a variational autoencoder to estimate class distributions and sample variational features from distributions that are more robust to the variance of support examples. Besides, we decouple classification and regression tasks so that VFA is performed on the classification branch without affecting object localization. Extensive experiments on PASCAL VOC and COCO demonstrate that our method significantly outperforms a strong baseline (up to 16\%) and previous state-of-the-art methods (4\% in average). Code will be available at: \url{https://github.com/csuhan/VFA}

翻译：由于小样本目标检测器通常使用大量基类样本进行训练，并在少量新类样本上进行微调，因此学习到的模型往往偏向于基类，且对新类样本的方差敏感。为解决这一问题，我们提出了一种包含两种新颖特征聚合方案的元学习框架。具体而言，我们首先提出了一种类别无关聚合（CAA）方法，其中查询特征和支持特征可以不受类别限制地进行聚合。不同类别之间的交互促进了类别无关表征，并减少了基类与新类之间的混淆。基于CAA，我们进一步提出了一种变分特征聚合（VFA）方法，该方法将支持样本编码为类别级支持特征，以实现稳健的特征聚合。我们利用变分自编码器来估计类别分布，并从分布中采样变分特征，这些特征对支持样本的方差更具鲁棒性。此外，我们将分类和回归任务解耦，使得VFA仅在分类分支上执行，而不会影响目标定位。在PASCAL VOC和COCO数据集上的大量实验表明，我们的方法显著优于强基线（最高提升16%）以及先前的最先进方法（平均提升4%）。代码将在以下地址公开：\url{https://github.com/csuhan/VFA}

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日