Compute Efficiency and Serial Runtime Tradeoffs for Stochastic Momentum Methods - 专知论文

会员服务 ·

0

序列化 · 动量 · 动量法 · SGD · Batch Size ·

Compute Efficiency and Serial Runtime Tradeoffs for Stochastic Momentum Methods

翻译：暂无翻译

Depen Morwani,Alexandru Meterez,Pranav Nair,Sham Kakade

Stochastic momentum methods such as heavy ball (HB), Nesterov momentum, and variants of Accelerated SGD (ASGD) [Kidambi et al., 2018] are widely used in modern training, but their stochastic benefits depend on two distinct quantities: serial runtime, the number of iterations needed to reach a target accuracy, and compute efficiency (CE), the inverse total gradient-query or FLOP cost. Larger batches reduce serial runtime without hurting CE only when the contraction gap grows linearly with batch size. We study stochastic HB and ASGD for consistent linear regression with Gaussian covariates and prove finite-dimensional, discrete-time lower bounds on their batch-size tradeoffs. Our first result shows that HB does not improve the CE frontier over SGD for arbitrary spectra; rather, it preserves SGD-level CE over a larger batch-size window, allowing larger batches to reduce serial runtime until HB reaches its deterministic accelerated scale. This window can be a factor $\sqrtκ$ larger than the SGD critical batch size. For ASGD, the picture is more spectrum-dependent: for rapidly decaying power-law spectra, ASGD improves small-batch CE over HB/SGD, but as batch size grows it trades this CE advantage for improved serial runtime. Synthetic linear-regression experiments verify these qualitative regimes, including near-overlap of ASGD and HB for slowly decaying spectra and the predicted CE--serial tradeoff for rapidly decaying spectra.

翻译：暂无翻译

0

相关内容

序列化

序列化 (Serialization)将对象的状态信息转换为可以存储或传输的形式的过程。

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

专知会员服务

19+阅读 · 2025年9月9日

最高9.0分！这16篇最高分ICLR2025论文必看！从生成模型到MOE等

最高9.0分！这16篇最高分ICLR2025论文必看！从生成模型到MOE等

专知会员服务

26+阅读 · 2024年11月19日

ICLR2024｜Mol-Instructions: 面向大模型的大规模生物分子指令数据集

ICLR2024｜Mol-Instructions: 面向大模型的大规模生物分子指令数据集

专知会员服务

12+阅读 · 2024年2月10日

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

近期语音类前沿论文

近期语音类前沿论文

深度学习每日摘要

14+阅读 · 2019年3月17日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

香港中大-商汤科技联合实验室AAAI录用论文详解：ST-GCN时空图卷积网络模型

香港中大-商汤科技联合实验室AAAI录用论文详解：ST-GCN时空图卷积网络模型

商汤科技

12+阅读 · 2018年2月11日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

脉冲时滞微分方程的周期解及数值计算问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于电致离子输运的稀土配合物光写电读存储器的设计及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于离散化Lyapunov-Krasovskii泛函方法的时滞Markov跳变系统分析与综合

国家自然科学基金

0+阅读 · 2015年12月31日

易回收磷钼酸铵基高效Cs+捕集纳米复合材料的制备、表征与吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

具有时滞效应的微分向量优化问题的理论、算法及应用研究

国家自然科学基金

1+阅读 · 2015年12月31日

微惯性传感器振动系统的建模、全局动力学分析与时滞控制

国家自然科学基金

0+阅读 · 2014年12月31日

混凝土反应动力学和结构形成动力学的研究及计算机模拟

国家自然科学基金

0+阅读 · 2014年12月31日

脉冲微分系统的极小周期与概周期问题

国家自然科学基金

0+阅读 · 2014年12月31日

AlGaN/GaN MIS-HEMT器件在质子辐射下的退化机理，寿命预测模型与加固技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

Robust and Interpretable Adaptation of Equivariant Materials Foundation Models via Sparsity-promoting Fine-tuning

Arxiv

0+阅读 · 6月17日

Subgroup analysis in randomized controlled trials with binary outcomes: dilution and logic-respecting properties

Arxiv

0+阅读 · 6月16日

Neural dynamical systems on ferroelectric compute-in-memory for real-time forecasting

Arxiv

0+阅读 · 6月15日

Second-level global sensitivity analysis of numerical simulators with application to an accident scenario in a sodium-cooled fast reactor

Arxiv

0+阅读 · 5月28日

Fault Tolerance of Accelerated Asynchronous Fixed-Point Iterations on Flexible Computing Infrastructure

Arxiv

0+阅读 · 5月27日

Estimating Dynamic Marginal Policy Effects under Sequential Unconfoundedness

Arxiv

0+阅读 · 5月25日

Semiparametric Efficient Bilevel Gradient Estimation

Arxiv

0+阅读 · 5月20日

A Scalable Nonparametric Continuous-Time Survival Model through Numerical Quadrature

Arxiv

0+阅读 · 5月15日

Stochastic tensor space feature theory with applications to robust machine learning

Arxiv

0+阅读 · 5月12日

STA-FEM: Exact Streaming Assembly for Preplanned Dynamic Tetrahedral Topology Edits

Arxiv

0+阅读 · 5月12日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

1+阅读 · 今天14:40

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

1+阅读 · 今天14:36

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

7+阅读 · 今天2:06

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

5+阅读 · 今天1:37

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

3+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

5+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

5+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

7+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

7+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

4+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

6+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

6+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

5+阅读 · 6月17日

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

4+阅读 · 6月17日

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

6+阅读 · 6月16日

相关VIP内容

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

专知会员服务

19+阅读 · 2025年9月9日

最高9.0分！这16篇最高分ICLR2025论文必看！从生成模型到MOE等

最高9.0分！这16篇最高分ICLR2025论文必看！从生成模型到MOE等

专知会员服务

26+阅读 · 2024年11月19日

ICLR2024｜Mol-Instructions: 面向大模型的大规模生物分子指令数据集

ICLR2024｜Mol-Instructions: 面向大模型的大规模生物分子指令数据集

专知会员服务

12+阅读 · 2024年2月10日

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

相关资讯

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

近期语音类前沿论文

近期语音类前沿论文

深度学习每日摘要

14+阅读 · 2019年3月17日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

香港中大-商汤科技联合实验室AAAI录用论文详解：ST-GCN时空图卷积网络模型

香港中大-商汤科技联合实验室AAAI录用论文详解：ST-GCN时空图卷积网络模型

商汤科技

12+阅读 · 2018年2月11日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

Robust and Interpretable Adaptation of Equivariant Materials Foundation Models via Sparsity-promoting Fine-tuning

Arxiv

0+阅读 · 6月17日

Subgroup analysis in randomized controlled trials with binary outcomes: dilution and logic-respecting properties

Arxiv

0+阅读 · 6月16日

Neural dynamical systems on ferroelectric compute-in-memory for real-time forecasting

Arxiv

0+阅读 · 6月15日

Second-level global sensitivity analysis of numerical simulators with application to an accident scenario in a sodium-cooled fast reactor

Arxiv

0+阅读 · 5月28日

Fault Tolerance of Accelerated Asynchronous Fixed-Point Iterations on Flexible Computing Infrastructure

Arxiv

0+阅读 · 5月27日

Estimating Dynamic Marginal Policy Effects under Sequential Unconfoundedness

Arxiv

0+阅读 · 5月25日

Semiparametric Efficient Bilevel Gradient Estimation

Arxiv

0+阅读 · 5月20日

A Scalable Nonparametric Continuous-Time Survival Model through Numerical Quadrature

Arxiv

0+阅读 · 5月15日

Stochastic tensor space feature theory with applications to robust machine learning

Arxiv

0+阅读 · 5月12日

STA-FEM: Exact Streaming Assembly for Preplanned Dynamic Tetrahedral Topology Edits

Arxiv

0+阅读 · 5月12日

相关基金

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

脉冲时滞微分方程的周期解及数值计算问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于电致离子输运的稀土配合物光写电读存储器的设计及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于离散化Lyapunov-Krasovskii泛函方法的时滞Markov跳变系统分析与综合

国家自然科学基金

0+阅读 · 2015年12月31日

易回收磷钼酸铵基高效Cs+捕集纳米复合材料的制备、表征与吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

具有时滞效应的微分向量优化问题的理论、算法及应用研究

国家自然科学基金

1+阅读 · 2015年12月31日

微惯性传感器振动系统的建模、全局动力学分析与时滞控制

国家自然科学基金

0+阅读 · 2014年12月31日

混凝土反应动力学和结构形成动力学的研究及计算机模拟

国家自然科学基金

0+阅读 · 2014年12月31日

脉冲微分系统的极小周期与概周期问题

国家自然科学基金

0+阅读 · 2014年12月31日

AlGaN/GaN MIS-HEMT器件在质子辐射下的退化机理，寿命预测模型与加固技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员