Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize

Predict-then-Optimize is a framework for using machine learning to perform decision-making under uncertainty. The central research question it asks is, "How can the structure of a decision-making task be used to tailor ML models for that specific task?" To this end, recent work has proposed learning task-specific loss functions that capture this underlying structure. However, current approaches make restrictive assumptions about the form of these losses and their impact on ML model behavior. These assumptions both lead to approaches with high computational cost, and when they are violated in practice, poor performance. In this paper, we propose solutions to these issues, avoiding the aforementioned assumptions and utilizing the ML model's features to increase the sample efficiency of learning loss functions. We empirically show that our method achieves state-of-the-art results in four domains from the literature, often requiring an order of magnitude fewer samples than comparable methods from past work. Moreover, our approach outperforms the best existing method by nearly 200% when the localness assumption is broken.

翻译：预测后优化是一种利用机器学习在不确定性下进行决策的框架。其核心研究问题是："如何利用决策任务的结构来为该特定任务定制机器学习模型？"为此，近期工作提出了学习任务特定损失函数以捕捉这种底层结构。然而，现有方法对这些损失函数的形式及其对机器学习模型行为的影响做出了限制性假设。这些假设不仅导致方法计算成本高昂，而且当假设在实践中被违反时，性能会变差。本文提出了解决这些问题的方案，避开了前述假设，并利用机器学习模型的特征来提高损失函数学习的样本效率。实验表明，我们的方法在来自文献的四个领域达到了最先进的结果，通常所需样本量比以往工作中类似方法少一个数量级。此外，当局部性假设被打破时，我们的方法性能比现有最佳方法提升了近200%。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

85+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日