Finite-sample performance of the maximum likelihood estimator in logistic regression

Logistic regression is a classical model for describing the probabilistic dependence of binary responses to multivariate covariates. We consider the predictive performance of the maximum likelihood estimator (MLE) for logistic regression, assessed in terms of logistic risk. We consider two questions: first, that of the existence of the MLE (which occurs when the dataset is not linearly separated), and second, that of its accuracy when it exists. These properties depend on both the dimension of covariates and the signal strength. In the case of Gaussian covariates and a well-specified logistic model, we obtain sharp non-asymptotic guarantees for the existence and excess logistic risk of the MLE. We then generalize these results in two ways: first, to non-Gaussian covariates satisfying a certain two-dimensional margin condition, and second to the general case of statistical learning with a possibly misspecified logistic model. Finally, we consider the case of a Bernoulli design, where the behavior of the MLE is highly sensitive to the parameter direction.

翻译：逻辑回归是描述二元响应变量与多元协变量之间概率依赖关系的经典模型。本文考察逻辑回归中最大似然估计（MLE）的预测性能，该性能通过逻辑风险进行评估。我们探讨两个问题：首先是MLE的存在性（当数据集未被线性分离时出现），其次是其存在时的准确性。这些性质同时取决于协变量的维度和信号强度。在协变量服从高斯分布且逻辑模型设定正确的情况下，我们获得了关于MLE存在性及超额逻辑风险的尖锐非渐近保证。随后我们将这些结果从两个方面进行推广：首先推广至满足特定二维边界条件的非高斯协变量，其次推广至可能设定错误的逻辑模型的一般统计学习情形。最后，我们考察伯努利设计的情形，其中MLE的表现对参数方向具有高度敏感性。

相关内容

逻辑回归

关注 318

逻辑回归（也称“对数几率回归”）（英语：Logistic regression 或logit regression），即逻辑模型（英语：Logit model，也译作“评定模型”、“分类评定模型”）是离散选择法模型之一，属于多重变量分析范畴，是社会学、生物统计学、临床、数量心理学、计量经济学、市场营销等统计实证分析的常用方法。在统计学中，logistic模型(或logit模型)用于对存在的某个类或事件的概率建模，例如通过/失败、赢/输、活着/死了或健康/生病。这可以扩展到建模若干类事件，如确定一个图像是否包含猫、狗、狮子等。图像中检测到的每个物体的概率都在0到1之间，其和为1。

【NeurIPS2025】语言模型是高效的推理者吗？——来自逻辑编程的视角

专知会员服务

17+阅读 · 2025年11月3日

通过逻辑推理赋能大语言模型：综述

专知会员服务

32+阅读 · 2025年2月24日

174页！《大语言模型》最新综述：能力与局限性分析

专知会员服务

64+阅读 · 2025年1月12日

【斯坦福博士论文】超越最大似然估计：分布感知的机器学习

专知会员服务

28+阅读 · 2024年9月21日