FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Few-shot learning aims to train models that can be generalized to novel classes with only a few samples. Recently, a line of works are proposed to enhance few-shot learning with accessible semantic information from class names. However, these works focus on improving existing modules such as visual prototypes and feature extractors of the standard few-shot learning framework. This limits the full potential use of semantic information. In this paper, we propose a novel few-shot learning framework that uses pre-trained language models based on contrastive learning. To address the challenge of alignment between visual features and textual embeddings obtained from text-based pre-trained language model, we carefully design the textual branch of our framework and introduce a metric module to generalize the cosine similarity. For better transferability, we let the metric module adapt to different few-shot tasks and adopt MAML to train the model via bi-level optimization. Moreover, we conduct extensive experiments on multiple benchmarks to demonstrate the effectiveness of our method.

翻译：小样本学习旨在训练那些仅凭少量样本即可泛化到新类别的模型。近年来，一系列研究致力于利用类别名称中可获取的语义信息来增强小样本学习性能。然而，这些工作主要聚焦于改进标准小样本学习框架中的现有模块，例如视觉原型和特征提取器，这限制了语义信息潜力的充分发掘。本文提出了一种基于对比学习的新型小样本学习框架，该框架采用预训练语言模型。为应对从基于文本的预训练语言模型中获取的视觉特征与文本嵌入之间的对齐挑战，我们精心设计了框架中的文本分支，并引入了一个度量模块来泛化余弦相似度。为提升迁移能力，我们使该度量模块能够适应不同的小样本任务，并采用模型无关元学习（MAML）通过双层优化训练模型。此外，我们在多个基准数据集上进行了广泛实验，验证了该方法的有效性。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日