An Analysis of Loss Functions for Binary Classification and Regression

This paper explores connections between margin-based loss functions and consistency in binary classification and regression applications. It is shown that a large class of margin-based loss functions for binary classification/regression result in estimating scores equivalent to log-likelihood scores weighted by an even function. A simple characterization for conformable (consistent) loss functions is given, which allows for straightforward comparison of different losses, including exponential loss, logistic loss, and others. The characterization is used to construct a new Huber-type loss function for the logistic model. A simple relation between the margin and standardized logistic regression residuals is derived, demonstrating that all margin-based loss can be viewed as loss functions of squared standardized logistic regression residuals. The relation provides new, straightforward interpretations for exponential and logistic loss, and aids in understanding why exponential loss is sensitive to outliers. In particular, it is shown that minimizing empirical exponential loss is equivalent to minimizing the sum of squared standardized logistic regression residuals. The relation also provides new insight into the AdaBoost algorithm.

翻译：本文探讨了基于边界的损失函数与二分类及回归应用中一致性之间的关联。研究表明，二分类/回归中一大类基于边界的损失函数实际上等价于由偶函数加权的对数似然分数估计。本文给出了相容（一致）损失函数的简洁特征描述，使得不同损失函数（包括指数损失、逻辑损失等）的比较更为直接。利用该特征描述，本文为逻辑模型构建了一种新型Huber型损失函数。推导得出边界与标准化逻辑回归残差之间的简单关系，证明所有基于边界的损失均可视为标准化逻辑回归残差平方的损失函数。这一关系为指数损失和逻辑损失提供了全新的直观解释，并有助于理解指数损失为何对异常值敏感。特别地，研究表明最小化经验指数损失等价于最小化标准化逻辑回归残差平方和。该关系还为AdaBoost算法提供了新的见解。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日