提升CMA-ES在含噪机器人优化问题中的收敛速度、效率与可靠性 (Improving CMA-ES Convergence Speed, Efficiency, and Reliability in Noisy Robot Optimization Problems) - 专知论文

会员服务 ·

0

收敛速度 · 优化问题 · 精度 · 噪声 · 代价 ·

Improving CMA-ES Convergence Speed, Efficiency, and Reliability in Noisy Robot Optimization Problems

翻译：提升CMA-ES在含噪机器人优化问题中的收敛速度、效率与可靠性

Russell M. Martin,Steven H. Collins

from arxiv, This is the authors' final accepted manuscript (post-peer-review, pre-publication). It has been accepted for publication in Evolutionary Computation on 12 Jan 2026. For associated code, see https://github.com/RussellMMartin/AS-CMA-ES

Experimental robot optimization often requires evaluating each candidate policy for seconds to minutes. The chosen evaluation time influences optimization because of a speed-accuracy tradeoff: shorter evaluations enable faster iteration, but are also more subject to noise. Here, we introduce a supplement to the CMA-ES optimization algorithm, named Adaptive Sampling CMA-ES (AS-CMA), which assigns sampling time to candidates based on predicted sorting difficulty, aiming to achieve consistent precision. We compared AS-CMA to CMA-ES and Bayesian optimization using a range of static sampling times in four simulated cost landscapes. AS-CMA converged on 98% of all runs without adjustment to its tunable parameter, and converged 24-65% faster and with 29-76% lower total cost than each landscape's best CMA-ES static sampling time. As compared to Bayesian optimization, AS-CMA converged more efficiently and reliably in complex landscapes, while in simpler landscapes, AS-CMA was less efficient but equally reliable. We deployed AS-CMA in an exoskeleton optimization experiment and found the optimizer's behavior was consistent with expectations. These results indicate that AS-CMA can improve optimization efficiency in the presence of noise while minimally affecting optimization setup complexity and tuning requirements.

翻译：机器人实验优化通常需要对每个候选策略进行数秒至数分钟的性能评估。由于存在速度-精度权衡，所选评估时长会影响优化效果：较短的评估可实现更快的迭代，但也更易受噪声干扰。本文提出一种CMA-ES优化算法的补充方案——自适应采样CMA-ES（AS-CMA），该算法根据预测的排序难度为候选策略分配采样时间，旨在实现稳定的评估精度。我们在四种模拟代价场景中，将AS-CMA与采用固定采样时长的CMA-ES及贝叶斯优化进行对比。AS-CMA在未调整可调参数的情况下实现了98%的总体收敛率，且收敛速度比各场景中最佳固定采样时长的CMA-ES快24-65%，总代价降低29-76%。与贝叶斯优化相比，AS-CMA在复杂场景中收敛效率更高、可靠性更强；在简单场景中效率较低但可靠性相当。我们将AS-CMA部署于外骨骼优化实验，发现优化器的行为符合预期。这些结果表明，AS-CMA能在噪声环境下提升优化效率，同时将优化设置复杂度与参数调整需求控制在最低水平。

0

相关内容

收敛速度

《在军事仿真环境中优化人工智能算法》最新73页

《在军事仿真环境中优化人工智能算法》最新73页

专知会员服务

34+阅读 · 2024年11月12日

《评估人工智能和辅助自动化指挥与控制决策辅助工具以提高任务效率的分析框架》

《评估人工智能和辅助自动化指挥与控制决策辅助工具以提高任务效率的分析框架》

专知会员服务

137+阅读 · 2023年7月10日

【斯坦福博士论文】机器人仿真与控制的组合优化，210页pdf

【斯坦福博士论文】机器人仿真与控制的组合优化，210页pdf

专知会员服务

51+阅读 · 2023年4月12日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知会员服务

37+阅读 · 2022年7月12日

【干货书】优化与机器学习，Optimization and Machine Learning Optimization for Machine Learning and Machine Learning for Optimization

【干货书】优化与机器学习，Optimization and Machine Learning Optimization for Machine Learning and Machine Learning for Optimization

专知会员服务

40+阅读 · 2022年4月8日

【TPAMI】从人机对抗提出视觉跟踪智能评估新方法，Global Instance Tracking: Locating Target More Like Humans

【TPAMI】从人机对抗提出视觉跟踪智能评估新方法，Global Instance Tracking: Locating Target More Like Humans

专知会员服务

22+阅读 · 2022年3月29日

【CVPR 2022】AME：超参数优化中的注意力和记忆增强，AME: Attention and Memory Enhancement in Hyper-Parameter Optimization

【CVPR 2022】AME：超参数优化中的注意力和记忆增强，AME: Attention and Memory Enhancement in Hyper-Parameter Optimization

专知会员服务

11+阅读 · 2022年3月19日

【NeurIPS 2020】耶鲁大学等提出「AdaBelief」的新型优化器，速度快，训练稳，泛化强

专知会员服务

18+阅读 · 2020年10月19日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

专知

32+阅读 · 2022年10月17日

【美国陆军】《人工智能系统能否提高陆军任务指挥过程中的信息收集效率？》39页技术报告

【美国陆军】《人工智能系统能否提高陆军任务指挥过程中的信息收集效率？》39页技术报告

专知

50+阅读 · 2022年8月31日

以BERT为例,如何优化机器学习模型性能?

以BERT为例,如何优化机器学习模型性能?

专知

10+阅读 · 2019年10月3日

成熟的目标检测，也该自己学习数据增强策略达到SOTA了

成熟的目标检测，也该自己学习数据增强策略达到SOTA了

机器之心

17+阅读 · 2019年6月28日

【前沿】让机器像人类一样学习? 伯克利 AI 研究院提出新的元强化学习算法！

【前沿】让机器像人类一样学习? 伯克利 AI 研究院提出新的元强化学习算法！

中国自动化学会

11+阅读 · 2019年6月18日

机器学习中的最优化算法总结

机器学习中的最优化算法总结

人工智能前沿讲习班

22+阅读 · 2019年3月22日

CVPR 2019：中科院、牛津等提出SiamMask网络，视频跟踪最高精度

CVPR 2019：中科院、牛津等提出SiamMask网络，视频跟踪最高精度

新智元

11+阅读 · 2019年3月8日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

加速机器学习：从主动学习到BERT和流体标注

加速机器学习：从主动学习到BERT和流体标注

AINLP

15+阅读 · 2018年12月12日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

基于冗余结构的自适应容错并联机器人设计理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向类人机器人动作规划的参数最优控制技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

网络化遥操作多机器人系统时滞相关控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

可与MPSoC高度融合的片上自主测试-自主修复关键技术研究：针对自然、人为可靠性威胁

国家自然科学基金

0+阅读 · 2015年12月31日

动态环境下决策单元效率评价方法与应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

机制转化下的最优停时问题研究---以金融中投资决策分析为例

国家自然科学基金

2+阅读 · 2014年12月31日

基于深度学习的机器译文质量估计方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models

Arxiv

0+阅读 · 2月18日

Accelerating Robotic Reinforcement Learning with Agent Guidance

Arxiv

0+阅读 · 2月12日

RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation

Arxiv

0+阅读 · 2月12日

cmaes: A Simple yet Practical Python Library for CMA-ES

Arxiv

0+阅读 · 2月7日

Consensus-based optimization (CBO): Towards Global Optimality in Robotics

Arxiv

0+阅读 · 2月6日

RocqSmith: Can Automatic Optimization Forge Better Proof Agents?

Arxiv

0+阅读 · 2月5日

Mapping-Guided Task Discovery and Allocation for Robotic Inspection of Underwater Structures

Arxiv

0+阅读 · 2月2日

Improved Convergence Rates of Muon Optimizer for Nonconvex Optimization

Arxiv

0+阅读 · 1月27日

Efficient Human-in-the-Loop Optimization via Priors Learned from User Models

Arxiv

0+阅读 · 1月25日

CoCoPlan: Adaptive Coordination and Communication for Multi-robot Systems in Dynamic and Unknown Environments

Arxiv

0+阅读 · 1月15日

VIP会员

文章信息

相关主题

相关VIP内容

《在军事仿真环境中优化人工智能算法》最新73页

《在军事仿真环境中优化人工智能算法》最新73页

专知会员服务

34+阅读 · 2024年11月12日

《评估人工智能和辅助自动化指挥与控制决策辅助工具以提高任务效率的分析框架》

《评估人工智能和辅助自动化指挥与控制决策辅助工具以提高任务效率的分析框架》

专知会员服务

137+阅读 · 2023年7月10日

【斯坦福博士论文】机器人仿真与控制的组合优化，210页pdf

【斯坦福博士论文】机器人仿真与控制的组合优化，210页pdf

专知会员服务

51+阅读 · 2023年4月12日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知会员服务

37+阅读 · 2022年7月12日

【干货书】优化与机器学习，Optimization and Machine Learning Optimization for Machine Learning and Machine Learning for Optimization

【干货书】优化与机器学习，Optimization and Machine Learning Optimization for Machine Learning and Machine Learning for Optimization

专知会员服务

40+阅读 · 2022年4月8日

【TPAMI】从人机对抗提出视觉跟踪智能评估新方法，Global Instance Tracking: Locating Target More Like Humans

【TPAMI】从人机对抗提出视觉跟踪智能评估新方法，Global Instance Tracking: Locating Target More Like Humans

专知会员服务

22+阅读 · 2022年3月29日

【CVPR 2022】AME：超参数优化中的注意力和记忆增强，AME: Attention and Memory Enhancement in Hyper-Parameter Optimization

【CVPR 2022】AME：超参数优化中的注意力和记忆增强，AME: Attention and Memory Enhancement in Hyper-Parameter Optimization

专知会员服务

11+阅读 · 2022年3月19日

【NeurIPS 2020】耶鲁大学等提出「AdaBelief」的新型优化器，速度快，训练稳，泛化强

专知会员服务

18+阅读 · 2020年10月19日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

专知

32+阅读 · 2022年10月17日

【美国陆军】《人工智能系统能否提高陆军任务指挥过程中的信息收集效率？》39页技术报告

【美国陆军】《人工智能系统能否提高陆军任务指挥过程中的信息收集效率？》39页技术报告

专知

50+阅读 · 2022年8月31日

以BERT为例,如何优化机器学习模型性能?

以BERT为例,如何优化机器学习模型性能?

专知

10+阅读 · 2019年10月3日

成熟的目标检测，也该自己学习数据增强策略达到SOTA了

成熟的目标检测，也该自己学习数据增强策略达到SOTA了

机器之心

17+阅读 · 2019年6月28日

【前沿】让机器像人类一样学习? 伯克利 AI 研究院提出新的元强化学习算法！

【前沿】让机器像人类一样学习? 伯克利 AI 研究院提出新的元强化学习算法！

中国自动化学会

11+阅读 · 2019年6月18日

机器学习中的最优化算法总结

机器学习中的最优化算法总结

人工智能前沿讲习班

22+阅读 · 2019年3月22日

CVPR 2019：中科院、牛津等提出SiamMask网络，视频跟踪最高精度

CVPR 2019：中科院、牛津等提出SiamMask网络，视频跟踪最高精度

新智元

11+阅读 · 2019年3月8日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

加速机器学习：从主动学习到BERT和流体标注

加速机器学习：从主动学习到BERT和流体标注

AINLP

15+阅读 · 2018年12月12日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

相关论文

Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models

Arxiv

0+阅读 · 2月18日

Accelerating Robotic Reinforcement Learning with Agent Guidance

Arxiv

0+阅读 · 2月12日

RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation

Arxiv

0+阅读 · 2月12日

cmaes: A Simple yet Practical Python Library for CMA-ES

Arxiv

0+阅读 · 2月7日

Consensus-based optimization (CBO): Towards Global Optimality in Robotics

Arxiv

0+阅读 · 2月6日

RocqSmith: Can Automatic Optimization Forge Better Proof Agents?

Arxiv

0+阅读 · 2月5日

Mapping-Guided Task Discovery and Allocation for Robotic Inspection of Underwater Structures

Arxiv

0+阅读 · 2月2日

Improved Convergence Rates of Muon Optimizer for Nonconvex Optimization

Arxiv

0+阅读 · 1月27日

Efficient Human-in-the-Loop Optimization via Priors Learned from User Models

Arxiv

0+阅读 · 1月25日

CoCoPlan: Adaptive Coordination and Communication for Multi-robot Systems in Dynamic and Unknown Environments

Arxiv

0+阅读 · 1月15日

相关基金

基于冗余结构的自适应容错并联机器人设计理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向类人机器人动作规划的参数最优控制技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

网络化遥操作多机器人系统时滞相关控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

可与MPSoC高度融合的片上自主测试-自主修复关键技术研究：针对自然、人为可靠性威胁

国家自然科学基金

0+阅读 · 2015年12月31日

动态环境下决策单元效率评价方法与应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

机制转化下的最优停时问题研究---以金融中投资决策分析为例

国家自然科学基金

2+阅读 · 2014年12月31日

基于深度学习的机器译文质量估计方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员