Optimizing for ROC Curves on Class-Imbalanced Data by Training over a Family of Loss Functions

Although binary classification is a well-studied problem in computer vision, training reliable classifiers under severe class imbalance remains a challenging problem. Recent work has proposed techniques that mitigate the effects of training under imbalance by modifying the loss functions or optimization methods. While this work has led to significant improvements in the overall accuracy in the multi-class case, we observe that slight changes in hyperparameter values of these methods can result in highly variable performance in terms of Receiver Operating Characteristic (ROC) curves on binary problems with severe imbalance. To reduce the sensitivity to hyperparameter choices and train more general models, we propose training over a family of loss functions, instead of a single loss function. We develop a method for applying Loss Conditional Training (LCT) to an imbalanced classification problem. Extensive experiment results, on both CIFAR and Kaggle competition datasets, show that our method improves model performance and is more robust to hyperparameter choices. Code will be made available at: https://github.com/klieberman/roc_lct.

翻译：尽管二分类是计算机视觉中研究较充分的问题，但在严重类别不平衡条件下训练可靠分类器仍具挑战性。近期研究通过改进损失函数或优化方法提出了缓解不平衡训练影响的若干技术。虽然这些工作在多分类任务中显著提升了整体准确率，但我们发现这些方法超参数值的细微变化会导致严重不平衡二分类问题的接收者操作特征（ROC）曲线产生高度变异性能。为降低对超参数选择的敏感性并训练更通用的模型，我们提出采用损失函数族而非单一损失函数进行训练。我们开发了将损失条件训练（LCT）应用于不平衡分类问题的方法。在CIFAR和Kaggle竞赛数据集上的大量实验结果表明，本方法能够提升模型性能并增强对超参数选择的鲁棒性。代码将在以下网址开源：https://github.com/klieberman/roc_lct。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日