Automatic Combination of Sample Selection Strategies for Few-Shot Learning

In few-shot learning, such as meta-learning, few-shot fine-tuning or in-context learning, the limited number of samples used to train a model have a significant impact on the overall success. Although a large number of sample selection strategies exist, their impact on the performance of few-shot learning is not extensively known, as most of them have been so far evaluated in typical supervised settings only. In this paper, we thoroughly investigate the impact of 20 sample selection strategies on the performance of 5 few-shot learning approaches over 8 image and 6 text datasets. In addition, we propose a new method for automatic combination of sample selection strategies (ACSESS) that leverages the strengths and complementary information of the individual strategies. The experimental results show that our method consistently outperforms the individual selection strategies, as well as the recently proposed method for selecting support examples for in-context learning. We also show a strong modality, dataset and approach dependence for the majority of strategies as well as their dependence on the number of shots - demonstrating that the sample selection strategies play a significant role for lower number of shots, but regresses to random selection at higher number of shots.

翻译：在小样本学习（如元学习、小样本微调或上下文学习）中，用于训练模型的样本数量有限，这对整体成功率具有显著影响。尽管存在大量样本选择策略，但它们对小样本学习性能的影响尚未被广泛了解，因为到目前为止，大多数策略仅在典型的监督学习设置下进行了评估。本文深入研究了20种样本选择策略对5种小样本学习方法在8个图像数据集和6个文本数据集上的性能影响。此外，我们提出了一种新的自动组合样本选择策略方法（ACSESS），该方法利用了单个策略的优势和互补信息。实验结果表明，我们的方法在性能上始终优于单个选择策略以及最近提出的用于上下文学习支持样本选择的方法。我们还发现，大多数策略对模态、数据集和方法具有强烈的依赖性，并且这些策略依赖于样本数量——这表明样本选择策略在样本数量较少时发挥重要作用，但在样本数量较多时退化为随机选择。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日