A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization - 专知论文

会员服务 ·

0

超参数 · Continuity · Lipschitz · 赌博机/老虎机 · 优化器 ·

2023 年 6 月 8 日

A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization

翻译：一种面向连续超参数优化的Lipschitz Bandit方法

Yasong Feng,Weijian Luo,Yimin Huang,Tianyu Wang

from arxiv, Some preliminaries and backgrounds are drawn from arXiv:2110.09722 by the first author and the last author, and their coauthor Z. Huang

One of the most critical problems in machine learning is HyperParameter Optimization (HPO), since choice of hyperparameters has a significant impact on final model performance. Although there are many HPO algorithms, they either have no theoretical guarantees or require strong assumptions. To this end, we introduce BLiE -- a Lipschitz-bandit-based algorithm for HPO that only assumes Lipschitz continuity of the objective function. BLiE exploits the landscape of the objective function to adaptively search over the hyperparameter space. Theoretically, we show that $(i)$ BLiE finds an $\epsilon$-optimal hyperparameter with $\mathcal{O} \left( \epsilon^{-(d_z + \beta)}\right)$ total budgets, where $d_z$ and $\beta$ are problem intrinsic; $(ii)$ BLiE is highly parallelizable. Empirically, we demonstrate that BLiE outperforms the state-of-the-art HPO algorithms on benchmark tasks. We also apply BLiE to search for noise schedule of diffusion models. Comparison with the default schedule shows that BLiE schedule greatly improves the sampling speed.

翻译：机器学习中最关键的问题之一是超参数优化（HPO），因为超参数的选择会对最终模型性能产生显著影响。尽管存在许多HPO算法，但它们要么缺乏理论保证，要么需要强假设条件。为此，我们提出BLiE——一种基于Lipschitz bandit的HPO算法，该算法仅假设目标函数满足Lipschitz连续性。BLiE利用目标函数的景观特征在超参数空间中进行自适应搜索。理论方面，我们证明：(i) BLiE在总预算为$\mathcal{O} \left( \epsilon^{-(d_z + \beta)}\right)$的条件下能够找到$\epsilon$-最优超参数，其中$d_z$和$\beta$是问题固有的参数；(ii) BLiE具有高度可并行性。实验方面，我们证明BLiE在基准任务上优于最先进的HPO算法。我们还将BLiE应用于扩散模型噪声调度的搜索，与默认调度相比，BLiE调度显著提升了采样速度。

0

相关内容

超参数

在贝叶斯统计中，超参数是先验分布的参数；该术语用于将它们与所分析的基础系统的模型参数区分开。

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

73+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

247+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

积分算子与函数方程的动力学研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳酸盐“clumped”同位素的理论研究：到达平衡的反应时间和酸解过程的反应机理

国家自然科学基金

0+阅读 · 2014年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

高容量型纳米ZnMeFe2O4/C核壳结构材料的设计与嵌脱锂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Al2O3和TiOx在CaO-CaF2-SiO2渣系的热力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

二阶非完整约束机械系统的动力学综合与控制

国家自然科学基金

0+阅读 · 2009年12月31日

泛函微分方程的多重概周期解和相关的分支问题

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

电热镦粗成形电热力多物理场耦合大变形机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

几何动力学在非完整系统几何数值积分中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Operator Splitting/Finite Element Methods for the Minkowski Problem

Arxiv

0+阅读 · 2023年8月1日

A Spectral Approach for the Dynamic Bradley-Terry Model

Arxiv

0+阅读 · 2023年7月31日

MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning

Arxiv

0+阅读 · 2023年7月31日

Optimal multi-environment causal regularization

Arxiv

0+阅读 · 2023年7月28日

Is this model reliable for everyone? Testing for strong calibration

Arxiv

0+阅读 · 2023年7月28日

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Arxiv

0+阅读 · 2023年7月27日

A Strategic Framework for Optimal Decisions in Football 1-vs-1 Shot-Taking Situations: An Integrated Approach of Machine Learning, Theory-Based Modeling, and Game Theory

Arxiv

0+阅读 · 2023年7月27日

Neural Networks for Scalar Input and Functional Output

Arxiv

0+阅读 · 2023年7月26日

On the Generalization Mystery in Deep Learning

Arxiv

10+阅读 · 2022年3月18日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

VIP会员

文章信息

相关主题

赌博机/老虎机

最新内容

《越野作战环境下路径规划的多准则整数规划模型》

《越野作战环境下路径规划的多准则整数规划模型》

专知会员服务

4+阅读 · 今天8:06

人工智能大语言模型引擎如何重塑全球冲突信息环境最新50页

人工智能大语言模型引擎如何重塑全球冲突信息环境最新50页

专知会员服务

3+阅读 · 今天8:00

《防空系统对自主武器系统辩论中“有意义的人类控制”的启示》70页报告

《防空系统对自主武器系统辩论中“有意义的人类控制”的启示》70页报告

专知会员服务

3+阅读 · 今天7:53

“对标ChatGPT”：乌军研发Marichka AI系统用于战场筹划

“对标ChatGPT”：乌军研发Marichka AI系统用于战场筹划

专知会员服务

6+阅读 · 今天7:49

《同步多无人机系统中的故障与通信》

《同步多无人机系统中的故障与通信》

专知会员服务

2+阅读 · 今天6:23

论文解读 | 医学图像修复中的扩散模型：挑战、分类与未来方向

论文解读 | 医学图像修复中的扩散模型：挑战、分类与未来方向

专知会员服务

2+阅读 · 7月28日

博士论文 | 从算法到基础模型：强化学习的统一视角

博士论文 | 从算法到基础模型：强化学习的统一视角

专知会员服务

7+阅读 · 7月28日

面向国防作战的最佳自主与蜂群无人机技术

面向国防作战的最佳自主与蜂群无人机技术

专知会员服务

7+阅读 · 7月28日

《异构人类团队的协作决策过程混合建模研究》

《异构人类团队的协作决策过程混合建模研究》

专知会员服务

8+阅读 · 7月28日

《C5ISR系统中的注意力动态与自适应决策支持研究：视觉与多模态注意力引导对任务绩效影响的递归量化分析》最新36页报告

《C5ISR系统中的注意力动态与自适应决策支持研究：视觉与多模态注意力引导对任务绩效影响的递归量化分析》最新36页报告

专知会员服务

8+阅读 · 7月28日

《设计思维中的人机协作：生成式人工智能对共情访谈影响的探究》140页

《设计思维中的人机协作：生成式人工智能对共情访谈影响的探究》140页

专知会员服务

9+阅读 · 7月28日

博士论文 | 面向大模型推理的内存高效算法

博士论文 | 面向大模型推理的内存高效算法

专知会员服务

5+阅读 · 7月27日

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

专知会员服务

10+阅读 · 7月27日

《无人系统互操作性导论——无人系统联合架构（JAUS）》

《无人系统互操作性导论——无人系统联合架构（JAUS）》

专知会员服务

14+阅读 · 7月27日

美空军新型反无人机部队初探

美空军新型反无人机部队初探

专知会员服务

10+阅读 · 7月27日

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

73+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

247+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能大语言模型引擎如何重塑全球冲突信息环境最新50页

“对标ChatGPT”：乌军研发Marichka AI系统用于战场筹划

《越野作战环境下路径规划的多准则整数规划模型》

《防空系统对自主武器系统辩论中“有意义的人类控制”的启示》70页报告

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Operator Splitting/Finite Element Methods for the Minkowski Problem

Arxiv

0+阅读 · 2023年8月1日

A Spectral Approach for the Dynamic Bradley-Terry Model

Arxiv

0+阅读 · 2023年7月31日

MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning

Arxiv

0+阅读 · 2023年7月31日

Optimal multi-environment causal regularization

Arxiv

0+阅读 · 2023年7月28日

Is this model reliable for everyone? Testing for strong calibration

Arxiv

0+阅读 · 2023年7月28日

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Arxiv

0+阅读 · 2023年7月27日

A Strategic Framework for Optimal Decisions in Football 1-vs-1 Shot-Taking Situations: An Integrated Approach of Machine Learning, Theory-Based Modeling, and Game Theory

Arxiv

0+阅读 · 2023年7月27日

Neural Networks for Scalar Input and Functional Output

Arxiv

0+阅读 · 2023年7月26日

On the Generalization Mystery in Deep Learning

Arxiv

10+阅读 · 2022年3月18日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

相关基金

积分算子与函数方程的动力学研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳酸盐“clumped”同位素的理论研究：到达平衡的反应时间和酸解过程的反应机理

国家自然科学基金

0+阅读 · 2014年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

高容量型纳米ZnMeFe2O4/C核壳结构材料的设计与嵌脱锂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Al2O3和TiOx在CaO-CaF2-SiO2渣系的热力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

二阶非完整约束机械系统的动力学综合与控制

国家自然科学基金

0+阅读 · 2009年12月31日

泛函微分方程的多重概周期解和相关的分支问题

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

电热镦粗成形电热力多物理场耦合大变形机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

几何动力学在非完整系统几何数值积分中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员