Few-Shot Object Detection via Synthetic Features with Optimal Transport

Few-shot object detection aims to simultaneously localize and classify the objects in an image with limited training samples. However, most existing few-shot object detection methods focus on extracting the features of a few samples of novel classes that lack diversity. Hence, they may not be sufficient to capture the data distribution. To address that limitation, in this paper, we propose a novel approach in which we train a generator to generate synthetic data for novel classes. Still, directly training a generator on the novel class is not effective due to the lack of novel data. To overcome that issue, we leverage the large-scale dataset of base classes. Our overarching goal is to train a generator that captures the data variations of the base dataset. We then transform the captured variations into novel classes by generating synthetic data with the trained generator. To encourage the generator to capture data variations on base classes, we propose to train the generator with an optimal transport loss that minimizes the optimal transport distance between the distributions of real and synthetic data. Extensive experiments on two benchmark datasets demonstrate that the proposed method outperforms the state of the art. Source code will be available.

翻译：小样本目标检测旨在利用有限的训练样本同时定位和分类图像中的目标。然而，现有大多数小样本目标检测方法仅关注提取新颖类别少量样本的特征，由于这些样本缺乏多样性，可能不足以捕捉数据分布。为解决这一局限，本文提出了一种新方法，即训练生成器为新颖类别生成合成数据。但直接在新颖类别上训练生成器因缺乏新颖数据而效果不佳。为克服该问题，我们利用基类别的大规模数据集。总体目标是以捕捉基数据集数据变异性的方式训练生成器，并通过该生成器生成合成数据将捕获的变异性迁移至新颖类别。为促使生成器捕捉基类别的数据变异性，我们提出使用最优传输损失训练生成器，该损失最小化真实数据与合成数据分布之间的最优传输距离。在两个基准数据集上的大量实验表明，所提方法优于当前最优方法。源代码将公开。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日