Distribution-free risk assessment of regression-based machine learning algorithms

Machine learning algorithms have grown in sophistication over the years and are increasingly deployed for real-life applications. However, when using machine learning techniques in practical settings, particularly in high-risk applications such as medicine and engineering, obtaining the failure probability of the predictive model is critical. We refer to this problem as the risk-assessment task. We focus on regression algorithms and the risk-assessment task of computing the probability of the true label lying inside an interval defined around the model's prediction. We solve the risk-assessment problem using the conformal prediction approach, which provides prediction intervals that are guaranteed to contain the true label with a given probability. Using this coverage property, we prove that our approximated failure probability is conservative in the sense that it is not lower than the true failure probability of the ML algorithm. We conduct extensive experiments to empirically study the accuracy of the proposed method for problems with and without covariate shift. Our analysis focuses on different modeling regimes, dataset sizes, and conformal prediction methodologies.

翻译：近年来，机器学习算法日益复杂，并越来越多地部署于实际应用场景。然而，在将机器学习技术应用于实践环境时，尤其是在医学和工程等高危领域，获取预测模型的失效概率至关重要。我们将此问题称为风险评估任务。本文聚焦于回归算法，其具体任务是计算真实标签落在模型预测值定义区间内的概率。我们采用共形预测方法来解决该风险评估问题，该方法能提供保证以给定概率包含真实标签的预测区间。利用这一覆盖特性，我们证明了所提出的近似失效概率具有保守性，即不会低于机器学习算法的真实失效概率。我们开展了大量实验，从经验角度研究了所提方法在存在和不存在协变量偏移两种情况下的准确性。我们的分析涵盖了不同的建模范式、数据集规模以及共形预测方法。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2021】认知启发的时序知识图谱两阶段推理模型

专知会员服务

46+阅读 · 2021年8月6日

【机器学习术语宝典】机器学习中英文术语表

专知会员服务

62+阅读 · 2020年7月12日