Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises - 专知论文

会员服务 ·

0

非光滑 · 噪声 · 最优 · 梯度 · 参数选择 ·

2023 年 3 月 25 日

Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises

翻译：重尾噪声下的随机非光滑凸优化

Zijian Liu,Zhengyuan Zhou

Recently, several studies consider the stochastic optimization problem but in a heavy-tailed noise regime, i.e., the difference between the stochastic gradient and the true gradient is assumed to have a finite $p$-th moment (say being upper bounded by $\sigma^{p}$ for some $\sigma\geq0$) where $p\in(1,2]$, which not only generalizes the traditional finite variance assumption ($p=2$) but also has been observed in practice for several different tasks. Under this challenging assumption, lots of new progress has been made for either convex or nonconvex problems, however, most of which only consider smooth objectives. In contrast, people have not fully explored and well understood this problem when functions are nonsmooth. This paper aims to fill this crucial gap by providing a comprehensive analysis of stochastic nonsmooth convex optimization with heavy-tailed noises. We revisit a simple clipping-based algorithm, whereas, which is only proved to converge in expectation but under the additional strong convexity assumption. Under appropriate choices of parameters, for both convex and strongly convex functions, we not only establish the first high-probability rates but also give refined in-expectation bounds compared with existing works. Remarkably, all of our results are optimal (or nearly optimal up to logarithmic factors) with respect to the time horizon $T$ even when $T$ is unknown in advance. Additionally, we show how to make the algorithm parameter-free with respect to $\sigma$, in other words, the algorithm can still guarantee convergence without any prior knowledge of $\sigma$.

翻译：近年来，多项研究关注随机优化问题，但考虑的是重尾噪声场景，即随机梯度与真实梯度之差被假定具有有限的 $p$ 阶矩（例如，对于某个 $\sigma \geq 0$，上界为 $\sigma^{p}$），其中 $p \in (1,2]$。这不仅推广了传统的有限方差假设（$p=2$），且在多个实际任务中已被观测到。在此挑战性假设下，针对凸或非凸问题已取得诸多新进展，然而，其中大多数仅考虑光滑目标函数。相比之下，当函数非光滑时，该问题尚未被充分探索和深入理解。本文旨在通过全面分析重尾噪声下的随机非光滑凸优化来填补这一关键空白。我们重新审视一个基于剪切的简单算法，而该算法此前仅在额外强凸性假设下被证明在期望意义上收敛。在适当的参数选择下，对于凸函数和强凸函数，我们不仅首次建立了高概率收敛率，而且相比现有工作，给出了改进的期望界。值得注意的是，我们的所有结果在时间范围 $T$ 上均为最优（或达到接近最优的对数因子），即使 $T$ 事先未知。此外，我们展示了如何使算法在 $\sigma$ 方面实现免参数化，即算法无需任何关于 $\sigma$ 的先验知识即可保证收敛。

0

相关内容

非光滑

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

105+阅读 · 2023年5月10日

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【经典书】凸优化：算法与复杂度，130页pdf

【经典书】凸优化：算法与复杂度，130页pdf

专知会员服务

81+阅读 · 2021年11月16日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【博士论文】机器学习中部分非凸和随机优化算法研究

专知会员服务

75+阅读 · 2020年12月7日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

232+阅读 · 2020年6月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

103+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

1+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

博客 | 机器学习中的数学基础（凸优化）

博客 | 机器学习中的数学基础（凸优化）

AI研习社

14+阅读 · 2018年12月16日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

基于惯性/嵌入式大气数据系统的火星进入段自主导航方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

递推局部多项式回归估计及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

集值优化问题的定性分析和定量分析

国家自然科学基金

0+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

一类有限混合半参数时间序列模型的研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

Projection-Free Online Convex Optimization with Stochastic Constraints

Arxiv

0+阅读 · 2023年5月16日

Mixed Laplace approximation for marginal posterior and Bayesian inference in error-in-operator model

Arxiv

0+阅读 · 2023年5月16日

Convex optimization over a probability simplex

Arxiv

0+阅读 · 2023年5月15日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月15日

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

Arxiv

0+阅读 · 2023年5月15日

Accelerated Single-Call Methods for Constrained Min-Max Optimization

Arxiv

0+阅读 · 2023年5月14日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Arxiv

0+阅读 · 2023年5月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Arxiv

15+阅读 · 2020年4月3日

VIP会员

文章信息

相关主题

最新内容

博士论文 | 面向大模型推理的内存高效算法

博士论文 | 面向大模型推理的内存高效算法

专知会员服务

2+阅读 · 7月27日

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

专知会员服务

3+阅读 · 7月27日

《无人系统互操作性导论——无人系统联合架构（JAUS）》

《无人系统互操作性导论——无人系统联合架构（JAUS）》

专知会员服务

11+阅读 · 7月27日

美空军新型反无人机部队初探

美空军新型反无人机部队初探

专知会员服务

7+阅读 · 7月27日

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

《对抗性电磁环境下远程巡飞弹作战的安全指挥与控制数据链》

专知会员服务

6+阅读 · 7月27日

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

《北约下一代建模与仿真（NexGen M&S）计划》2026年69页

专知会员服务

4+阅读 · 7月27日

《防空交战流程的概率建模研究》

《防空交战流程的概率建模研究》

专知会员服务

10+阅读 · 7月27日

ICML 2026 教程 | 数值优化理论还重要吗？

ICML 2026 教程 | 数值优化理论还重要吗？

专知会员服务

6+阅读 · 7月26日

ICM 2026 | 陶哲轩：人工智能时代的数学

ICM 2026 | 陶哲轩：人工智能时代的数学

专知会员服务

9+阅读 · 7月26日

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

《面向可扩展高韧性无人机集群网络的速度感知分层通信框架》

专知会员服务

8+阅读 · 7月26日

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

《面向概率推理的可定制战术引擎及其在军事任务规划中的应用》

专知会员服务

11+阅读 · 7月26日

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

《先进防空系统选型战略框架：基于巴基斯坦的实证启示》

专知会员服务

8+阅读 · 7月26日

《反无人机交战场景下的战斗归零研究》

《反无人机交战场景下的战斗归零研究》

专知会员服务

7+阅读 · 7月26日

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

霍尔木兹与不对称作战时代：水雷、无人系统与海军力量的重新定义

专知会员服务

4+阅读 · 7月26日

博士论文 | 用代码结构感知方法推进代码大模型

博士论文 | 用代码结构感知方法推进代码大模型

专知会员服务

6+阅读 · 7月25日

相关VIP内容

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

105+阅读 · 2023年5月10日

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【经典书】凸优化：算法与复杂度，130页pdf

【经典书】凸优化：算法与复杂度，130页pdf

专知会员服务

81+阅读 · 2021年11月16日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【博士论文】机器学习中部分非凸和随机优化算法研究

专知会员服务

75+阅读 · 2020年12月7日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

232+阅读 · 2020年6月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

103+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

论文解读 | 从预训练到后训练：理解大模型推理能力如何形成

美空军新型反无人机部队初探

博士论文 | 面向大模型推理的内存高效算法

《无人系统互操作性导论——无人系统联合架构（JAUS）》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

1+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

博客 | 机器学习中的数学基础（凸优化）

博客 | 机器学习中的数学基础（凸优化）

AI研习社

14+阅读 · 2018年12月16日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Projection-Free Online Convex Optimization with Stochastic Constraints

Arxiv

0+阅读 · 2023年5月16日

Mixed Laplace approximation for marginal posterior and Bayesian inference in error-in-operator model

Arxiv

0+阅读 · 2023年5月16日

Convex optimization over a probability simplex

Arxiv

0+阅读 · 2023年5月15日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月15日

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

Arxiv

0+阅读 · 2023年5月15日

Accelerated Single-Call Methods for Constrained Min-Max Optimization

Arxiv

0+阅读 · 2023年5月14日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Arxiv

0+阅读 · 2023年5月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Arxiv

15+阅读 · 2020年4月3日

相关基金

基于惯性/嵌入式大气数据系统的火星进入段自主导航方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

递推局部多项式回归估计及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

集值优化问题的定性分析和定量分析

国家自然科学基金

0+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

一类有限混合半参数时间序列模型的研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员