SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

损失函数（机器学习） · 泛函 · 损失 · Learning · 小样本学习 ·

2023 年 5 月 17 日

翻译：SuSana Distancia 即你所需：通过两种新颖的基于距离的损失函数在度量学习中强化类可分性以用于小样本图像分类

Mauricio Mendez-Ruiza,Jorge Gonzalez-Zapatab,Ivan Reyes-Amezcuab,Daniel Flores-Araizaa,Francisco Lopez-Tiroa,Andres Mendez-Vazquezb,Gilberto Ochoa-Ruiz

from arxiv, Paper submitted to a jorunal for publication

Few-shot learning is a challenging area of research that aims to learn new concepts with only a few labeled samples of data. Recent works based on metric-learning approaches leverage the meta-learning approach, which is encompassed by episodic tasks that make use a support (training) and query set (test) with the objective of learning a similarity comparison metric between those sets. Due to the lack of data, the learning process of the embedding network becomes an important part of the few-shot task. Previous works have addressed this problem using metric learning approaches, but the properties of the underlying latent space and the separability of the difference classes on it was not entirely enforced. In this work, we propose two different loss functions which consider the importance of the embedding vectors by looking at the intra-class and inter-class distance between the few data. The first loss function is the Proto-Triplet Loss, which is based on the original triplet loss with the modifications needed to better work on few-shot scenarios. The second loss function, which we dub ICNN loss is based on an inter and intra class nearest neighbors score, which help us to assess the quality of embeddings obtained from the trained network. Our results, obtained from a extensive experimental setup show a significant improvement in accuracy in the miniImagenNet benchmark compared to other metric-based few-shot learning methods by a margin of 2%, demonstrating the capability of these loss functions to allow the network to generalize better to previously unseen classes. In our experiments, we demonstrate competitive generalization capabilities to other domains, such as the Caltech CUB, Dogs and Cars datasets compared with the state of the art.

翻译：小样本学习是一个具有挑战性的研究领域，旨在仅通过少量标注样本学习新概念。近年来基于度量学习的方法通常借鉴元学习策略，该策略通过利用支持集（训练）和查询集（测试）的片段式任务，学习这两个集合间的相似度比较度量。由于数据匮乏，嵌入网络的学习过程成为小样本任务的关键环节。先前的研究虽采用度量学习方法解决该问题，但未能充分强化潜在空间的特性及不同类别在该空间中的可分性。本文提出两种考虑嵌入向量重要性的损失函数，通过分析少量数据的类内距离与类间距离来优化学习。第一种损失函数为Proto-Triplet Loss，其基于原始三元组损失进行针对性改进，以更好地适配小样本场景。第二种损失函数（命名为ICNN损失）基于类内与类间最近邻得分，可有效评估训练网络生成的嵌入质量。通过大规模实验设置，我们的结果表明：在miniImageNet基准测试中，与其他基于度量的小样本学习方法相比，准确率显著提升2%，证明这些损失函数能使网络更好地泛化至未见类别。实验同时表明，与现有最先进方法相比，本方法在Caltech CUB、Dogs及Cars等其他领域数据集上展现出具有竞争力的泛化能力。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日