主动磁悬浮系统的最优微分反馈控制：数据驱动方法的实验研究 (Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches) - 专知论文

会员服务 ·

0

系统 · 最优 · 设计 · 反馈控制 · 控制器 ·

Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches

翻译：主动磁悬浮系统的最优微分反馈控制：数据驱动方法的实验研究

Saber Omidi,Rene Akupan Ebunle,Se Young Yoon

from arxiv, 10 pages, 9 figures. Preprint; manuscript under journal review

This paper presents the design and implementation of data-driven optimal derivative feedback controllers for an active magnetic levitation system. A direct, model-free control design method based on the reinforcement learning framework is compared with an indirect optimal control design derived from a numerically identified mathematical model of the system. For the direct model-free approach, a policy iteration procedure is proposed, which adds an iteration layer called the epoch loop to gather multiple sets of process data, providing a more diverse dataset and helping reduce learning biases. This direct control design method is evaluated against a comparable optimal control solution designed from a plant model obtained through the combined Dynamic Mode Decomposition with Control (DMDc) and Prediction Error Minimization (PEM) system identification. Results show that while both controllers can stabilize and improve the performance of the magnetic levitation system when compared to controllers designed from a nominal model, the direct model-free approach consistently outperforms the indirect solution when multiple epochs are allowed. The iterative refinement of the optimal control law over the epoch loop provides the direct approach a clear advantage over the indirect method, which relies on a single set of system data to determine the identified model and control.

翻译：本文介绍了主动磁悬浮系统数据驱动最优微分反馈控制器的设计与实现。研究比较了基于强化学习框架的直接、无模型控制设计方法，与从系统数值辨识数学模型导出的间接最优控制设计。对于直接无模型方法，本文提出了一种策略迭代过程，该过程增加了一个称为"周期循环"的迭代层，以收集多组过程数据，从而提供更多样化的数据集并有助于减少学习偏差。这种直接控制设计方法与通过结合控制动态模态分解（DMDc）和预测误差最小化（PEM）系统辨识获得的被控对象模型所设计的可比最优控制解进行了对比评估。结果表明，与基于标称模型设计的控制器相比，两种控制器都能稳定并提升磁悬浮系统的性能；但当允许多个周期运行时，直接无模型方法始终优于间接解决方案。最优控制律在周期循环中的迭代优化使直接方法相比间接方法具有明显优势，后者仅依赖单组系统数据来确定辨识模型和控制律。

0

相关内容

《高速强机动目标制导方法优化途径：不同优化途径能力分析》

《高速强机动目标制导方法优化途径：不同优化途径能力分析》

专知会员服务

16+阅读 · 2025年11月30日

基于强化学习的最优控制指令模仿生成方法

基于强化学习的最优控制指令模仿生成方法

专知会员服务

33+阅读 · 2023年12月2日

【MIT博士论文】数据驱动的动态决策:算法、结构和复杂性分析，404页pdf

【MIT博士论文】数据驱动的动态决策:算法、结构和复杂性分析，404页pdf

专知会员服务

70+阅读 · 2023年9月22日

【普林斯顿博士论文】高维强化学习与最优控制问题，121页pdf

【普林斯顿博士论文】高维强化学习与最优控制问题，121页pdf

专知会员服务

50+阅读 · 2023年7月25日

【牛津大学博士论文】控制微分方程在流数据中的机器学习应用，166页pdf

【牛津大学博士论文】控制微分方程在流数据中的机器学习应用，166页pdf

专知会员服务

18+阅读 · 2023年1月13日

【伯克利马毅老师】强化学习与最优控制综述

【伯克利马毅老师】强化学习与最优控制综述

专知会员服务

78+阅读 · 2022年4月26日

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

专知会员服务

199+阅读 · 2021年2月17日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

《基于近端策略优化(PPO)算法的制导弹体控制行为学习》美国陆军2022最新27页技术报告

《基于近端策略优化(PPO)算法的制导弹体控制行为学习》美国陆军2022最新27页技术报告

专知

13+阅读 · 2022年11月25日

【伯克利马毅老师】强化学习与最优控制综述

【伯克利马毅老师】强化学习与最优控制综述

专知

20+阅读 · 2022年4月26日

【干货书】《机器学习动力系统与控制》，572页pdf

【干货书】《机器学习动力系统与控制》，572页pdf

专知

36+阅读 · 2022年1月8日

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

专知

58+阅读 · 2020年7月2日

浅谈主动学习（Active Learning）

浅谈主动学习（Active Learning）

凡人机器学习

32+阅读 · 2020年6月18日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

【MBSE】基于模型的系统工程在航空发动机控制设计中的应用

【MBSE】基于模型的系统工程在航空发动机控制设计中的应用

产业智能官

23+阅读 · 2019年7月3日

机器学习中的最优化算法总结

机器学习中的最优化算法总结

人工智能前沿讲习班

22+阅读 · 2019年3月22日

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

人工智能前沿讲习班

21+阅读 · 2018年12月21日

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

产业智能官

49+阅读 · 2018年7月4日

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

半主动悬置系统的传递力动态预估与实时解耦的研究

国家自然科学基金

1+阅读 · 2015年12月31日

偏微分方程最优控制问题的高精度低阶非协调有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁阻式悬浮系统的悬浮特性及混合励磁导向控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于扰动观测器的磁悬浮控制力矩陀螺磁轴承动框架扰动补偿方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

若干偏微分方程控制系统的适定正则性及稳定性分析

国家自然科学基金

0+阅读 · 2015年12月31日

有限范围随机最优控制系统的数值方法与均场倒向随机系统的最优控制问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

随机递归最优控制及其在金融中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

具摩擦非线性系统随机振动分析与最优控制

国家自然科学基金

0+阅读 · 2014年12月31日

基于动态规划粘性解及特征正交分解降维方法的偏微分方程最优控制

国家自然科学基金

0+阅读 · 2014年12月31日

Spatially-Aware Adaptive Trajectory Optimization with Controller-Guided Feedback for Autonomous Racing

Arxiv

0+阅读 · 2月17日

A Data-Driven Algorithm for Model-Free Control Synthesis

Arxiv

0+阅读 · 2月13日

Optimal Control of Microswimmers for Trajectory Tracking Using Bayesian Optimization

Arxiv

0+阅读 · 2月10日

Sim-to-Real Dynamic Object Manipulation on Conveyor Systems via Optimization Path Shaping

Arxiv

0+阅读 · 2月8日

Reinforcement Learning from Human Feedback

Arxiv

0+阅读 · 2月7日

Reservoir Predictive Path Integral Control for Unknown Nonlinear Dynamics

Arxiv

0+阅读 · 2月6日

Model-based Optimal Control for Rigid-Soft Underactuated Systems

Arxiv

0+阅读 · 2月3日

Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

Arxiv

0+阅读 · 2月2日

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Feedback Control via Integrated Sensing and Communication: Uncertainty Optimisation

Arxiv

0+阅读 · 1月30日

VIP会员

文章信息

相关主题

相关VIP内容

《高速强机动目标制导方法优化途径：不同优化途径能力分析》

《高速强机动目标制导方法优化途径：不同优化途径能力分析》

专知会员服务

16+阅读 · 2025年11月30日

基于强化学习的最优控制指令模仿生成方法

基于强化学习的最优控制指令模仿生成方法

专知会员服务

33+阅读 · 2023年12月2日

【MIT博士论文】数据驱动的动态决策:算法、结构和复杂性分析，404页pdf

【MIT博士论文】数据驱动的动态决策:算法、结构和复杂性分析，404页pdf

专知会员服务

70+阅读 · 2023年9月22日

【普林斯顿博士论文】高维强化学习与最优控制问题，121页pdf

【普林斯顿博士论文】高维强化学习与最优控制问题，121页pdf

专知会员服务

50+阅读 · 2023年7月25日

【牛津大学博士论文】控制微分方程在流数据中的机器学习应用，166页pdf

【牛津大学博士论文】控制微分方程在流数据中的机器学习应用，166页pdf

专知会员服务

18+阅读 · 2023年1月13日

【伯克利马毅老师】强化学习与最优控制综述

【伯克利马毅老师】强化学习与最优控制综述

专知会员服务

78+阅读 · 2022年4月26日

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

专知会员服务

199+阅读 · 2021年2月17日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

《基于近端策略优化(PPO)算法的制导弹体控制行为学习》美国陆军2022最新27页技术报告

《基于近端策略优化(PPO)算法的制导弹体控制行为学习》美国陆军2022最新27页技术报告

专知

13+阅读 · 2022年11月25日

【伯克利马毅老师】强化学习与最优控制综述

【伯克利马毅老师】强化学习与最优控制综述

专知

20+阅读 · 2022年4月26日

【干货书】《机器学习动力系统与控制》，572页pdf

【干货书】《机器学习动力系统与控制》，572页pdf

专知

36+阅读 · 2022年1月8日

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

专知

58+阅读 · 2020年7月2日

浅谈主动学习（Active Learning）

浅谈主动学习（Active Learning）

凡人机器学习

32+阅读 · 2020年6月18日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

【MBSE】基于模型的系统工程在航空发动机控制设计中的应用

【MBSE】基于模型的系统工程在航空发动机控制设计中的应用

产业智能官

23+阅读 · 2019年7月3日

机器学习中的最优化算法总结

机器学习中的最优化算法总结

人工智能前沿讲习班

22+阅读 · 2019年3月22日

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

人工智能前沿讲习班

21+阅读 · 2018年12月21日

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

产业智能官

49+阅读 · 2018年7月4日

相关论文

Spatially-Aware Adaptive Trajectory Optimization with Controller-Guided Feedback for Autonomous Racing

Arxiv

0+阅读 · 2月17日

A Data-Driven Algorithm for Model-Free Control Synthesis

Arxiv

0+阅读 · 2月13日

Optimal Control of Microswimmers for Trajectory Tracking Using Bayesian Optimization

Arxiv

0+阅读 · 2月10日

Sim-to-Real Dynamic Object Manipulation on Conveyor Systems via Optimization Path Shaping

Arxiv

0+阅读 · 2月8日

Reinforcement Learning from Human Feedback

Arxiv

0+阅读 · 2月7日

Reservoir Predictive Path Integral Control for Unknown Nonlinear Dynamics

Arxiv

0+阅读 · 2月6日

Model-based Optimal Control for Rigid-Soft Underactuated Systems

Arxiv

0+阅读 · 2月3日

Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

Arxiv

0+阅读 · 2月2日

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Feedback Control via Integrated Sensing and Communication: Uncertainty Optimisation

Arxiv

0+阅读 · 1月30日

相关基金

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

半主动悬置系统的传递力动态预估与实时解耦的研究

国家自然科学基金

1+阅读 · 2015年12月31日

偏微分方程最优控制问题的高精度低阶非协调有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁阻式悬浮系统的悬浮特性及混合励磁导向控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于扰动观测器的磁悬浮控制力矩陀螺磁轴承动框架扰动补偿方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

若干偏微分方程控制系统的适定正则性及稳定性分析

国家自然科学基金

0+阅读 · 2015年12月31日

有限范围随机最优控制系统的数值方法与均场倒向随机系统的最优控制问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

随机递归最优控制及其在金融中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

具摩擦非线性系统随机振动分析与最优控制

国家自然科学基金

0+阅读 · 2014年12月31日

基于动态规划粘性解及特征正交分解降维方法的偏微分方程最优控制

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员