最佳化欺骗了我们,如何阻止它 (Hyperparameter Optimization Is Deceiving Us, and How to Stop It) - 专知论文

会员服务 ·

0

超参数 · 优化器 · Processing（编程语言） · 随机搜索 · 子空间 ·

2021 年 6 月 3 日

Hyperparameter Optimization Is Deceiving Us, and How to Stop It

翻译：最佳化欺骗了我们,如何阻止它

A. Feder Cooper,Yucheng Lu,Jessica Zosa Forde,Christopher De Sa

Recent empirical work shows that inconsistent results, based on choice of hyperparameter optimization (HPO) configuration, are a widespread problem in ML research. When comparing two algorithms J and K, searching one subspace can yield the conclusion that J outperforms K, whereas searching another can entail the opposite. In short, the way we choose hyperparameters can deceive us. We provide a theoretical complement to this prior work, arguing that, to avoid such deception, the process of drawing conclusions from HPO should be made more rigorous. We call this process epistemic hyperparameter optimization (EHPO), and put forth a logical framework to capture its semantics and how it can lead to inconsistent conclusions about performance. Our framework enables us to prove EHPO methods that are guaranteed to be defended against deception. We demonstrate its utility by proving and empirically validating a defended variant of random search.

翻译：最近的经验工作表明,基于选择超参数优化(HPO)配置的不一致结果在ML研究中是一个普遍的问题。在比较两种算法J和K时,搜索一个子空间可以得出J优于K的结论,而搜索另一个子空间则可能产生相反的结果。简而言之,我们选择超参数的方式可以欺骗我们。我们为先前的这项工作提供了理论补充,认为为了避免这种欺骗,应当使从HPO得出结论的过程更加严格。我们称之为超参数优化(EHPO),并提出了一个逻辑框架来捕捉它的语义和如何导致关于性能的不一致的结论。我们的框架使我们能够证明EHPO方法有保证不受欺骗。我们通过证明和实验性地验证随机搜索的防御变体来证明它的效用。

0

相关内容

超参数

在贝叶斯统计中，超参数是先验分布的参数；该术语用于将它们与所分析的基础系统的模型参数区分开。

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

已删除

将门创投

7+阅读 · 2018年4月25日

Understanding the Effects of Adversarial Personalized Ranking Optimization Method on Recommendation Quality

Understanding the Effects of Adversarial Personalized Ranking Optimization Method on Recommendation Quality

Arxiv

0+阅读 · 2021年7月29日

Bayesian Optimization for Min Max Optimization

Arxiv

0+阅读 · 2021年7月29日

How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Arxiv

0+阅读 · 2021年7月28日

A Tale Of Two Long Tails

Arxiv

0+阅读 · 2021年7月27日

Power Constrained Bandits

Arxiv

0+阅读 · 2021年7月27日

Tight Guarantees for Multi-unit Prophet Inequalities and Online Stochastic Knapsack

Arxiv

0+阅读 · 2021年7月23日

Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings

Arxiv

0+阅读 · 2021年7月23日

A model-based approach to assess epidemic risk

Arxiv

0+阅读 · 2021年7月22日

Hyperparameter Selection for Imitation Learning

Arxiv

7+阅读 · 2021年5月25日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

已删除

将门创投

7+阅读 · 2018年4月25日

相关论文

Understanding the Effects of Adversarial Personalized Ranking Optimization Method on Recommendation Quality

Understanding the Effects of Adversarial Personalized Ranking Optimization Method on Recommendation Quality

Arxiv

0+阅读 · 2021年7月29日

Bayesian Optimization for Min Max Optimization

Arxiv

0+阅读 · 2021年7月29日

How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Arxiv

0+阅读 · 2021年7月28日

A Tale Of Two Long Tails

Arxiv

0+阅读 · 2021年7月27日

Power Constrained Bandits

Arxiv

0+阅读 · 2021年7月27日

Tight Guarantees for Multi-unit Prophet Inequalities and Online Stochastic Knapsack

Arxiv

0+阅读 · 2021年7月23日

Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings

Arxiv

0+阅读 · 2021年7月23日

A model-based approach to assess epidemic risk

Arxiv

0+阅读 · 2021年7月22日

Hyperparameter Selection for Imitation Learning

Arxiv

7+阅读 · 2021年5月25日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员