SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

损失函数（机器学习） · 泛函 · 损失 · Learning · 小样本学习 ·

2023 年 5 月 15 日

翻译：SuSana Distancia 即是一切：通过两种新颖的基于距离的损失函数在度量学习中实现类别可分性以用于小样本图像分类

Mauricio Mendez-Ruiza,Jorge Gonzalez-Zapatab,Ivan Reyes-Amezcuab,Daniel Flores-Araizaa,Francisco Lopez-Tiroa,Andres Mendez-Vazquezb,Gilberto Ochoa-Ruiz

from arxiv, Paper submitted to a jorunal for publication

Few-shot learning is a challenging area of research that aims to learn new concepts with only a few labeled samples of data. Recent works based on metric-learning approaches leverage the meta-learning approach, which is encompassed by episodic tasks that make use a support (training) and query set (test) with the objective of learning a similarity comparison metric between those sets. Due to the lack of data, the learning process of the embedding network becomes an important part of the few-shot task. Previous works have addressed this problem using metric learning approaches, but the properties of the underlying latent space and the separability of the difference classes on it was not entirely enforced. In this work, we propose two different loss functions which consider the importance of the embedding vectors by looking at the intra-class and inter-class distance between the few data. The first loss function is the Proto-Triplet Loss, which is based on the original triplet loss with the modifications needed to better work on few-shot scenarios. The second loss function, which we dub ICNN loss is based on an inter and intra class nearest neighbors score, which help us to assess the quality of embeddings obtained from the trained network. Our results, obtained from a extensive experimental setup show a significant improvement in accuracy in the miniImagenNet benchmark compared to other metric-based few-shot learning methods by a margin of 2%, demonstrating the capability of these loss functions to allow the network to generalize better to previously unseen classes. In our experiments, we demonstrate competitive generalization capabilities to other domains, such as the Caltech CUB, Dogs and Cars datasets compared with the state of the art.

翻译：小样本学习是一个具有挑战性的研究领域，旨在仅利用少量标记数据样本学习新概念。近期基于度量学习的方法借鉴了元学习策略，该策略通过包含支持集（训练集）和查询集（测试集）的情节任务来学习两者间的相似性比较度量。由于数据匮乏，嵌入网络的学习过程成为小样本任务的关键环节。先前的研究采用度量学习方法解决此问题，但未能充分强化潜在特征空间的属性及不同类别在该空间中的可分性。本文提出了两种考虑嵌入向量重要性的损失函数，通过分析小样本数据的类内距离与类间距离实现优化。第一种损失函数为原型三元组损失（Proto-Triplet Loss），它在原始三元组损失基础上进行改进，使其更适用于小样本场景。第二种损失函数称为ICNN损失，基于类内与类间最近邻评分，用于评估训练网络所生成嵌入向量的质量。通过广泛的实验设置，我们的结果显示，与其他基于度量的小样本学习方法相比，该方法在miniImageNet基准测试中准确率提升了约2%，证明了这些损失函数能使网络对未见类别具有更强的泛化能力。实验结果表明，该方法在Caltech CUB、Dogs和Cars等数据集上相较于现有技术展现出具有竞争力的跨领域泛化能力。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日