基于储层计算的预测路径积分控制：面向未知非线性动力学系统 (Reservoir Predictive Path Integral Control for Unknown Nonlinear Dynamics) - 专知论文

会员服务 ·

0

路径 · 系统 · 非线性动力 · 非线性动力学 · 动力学系统 ·

Reservoir Predictive Path Integral Control for Unknown Nonlinear Dynamics

翻译：基于储层计算的预测路径积分控制：面向未知非线性动力学系统

Daisuke Inoue,Tadayoshi Matsumori,Gouhei Tanaka,Yuji Ito

from arxiv, Submitted to IEEE for possible publication, 13 pages, 5 figures

Neural networks have found extensive application in data-driven control of nonlinear dynamical systems, yet fast online identification and control of unknown dynamics remain central challenges. To meet these challenges, this paper integrates echo-state networks (ESNs)--reservoir computing models implemented with recurrent neural networks--and model predictive path integral (MPPI) control--sampling-based variants of model predictive control. The proposed reservoir predictive path integral (RPPI) enables fast learning of nonlinear dynamics with ESNs and exploits the learned nonlinearities directly in MPPI control computation without linearization approximations. This framework is further extended to uncertainty-aware RPPI (URPPI), which achieves robust stochastic control by treating ESN output weights as random variables and minimizing an expected cost over their distribution to account for identification errors. Experiments on controlling a Duffing oscillator and a four-tank system demonstrate that URPPI improves control performance, reducing control costs by up to 60% compared to traditional quadratic programming-based model predictive control methods.

翻译：神经网络在非线性动力学系统的数据驱动控制中已得到广泛应用，然而对未知动力学系统进行快速在线辨识与控制仍是核心挑战。为应对这些挑战，本文融合了回声状态网络（一种基于循环神经网络实现的储层计算模型）与模型预测路径积分控制（一种基于采样的模型预测控制变体）。所提出的储层预测路径积分控制方法能够利用回声状态网络快速学习非线性动力学，并将学习到的非线性特性直接用于模型预测路径积分控制计算，无需进行线性化近似。该框架进一步扩展为不确定性感知的储层预测路径积分控制方法，该方法通过将回声状态网络的输出权重视为随机变量，并最小化其分布上的期望成本以补偿辨识误差，从而实现了鲁棒的随机控制。通过对杜芬振子和四水箱系统的控制实验表明，相较于传统的基于二次规划的模型预测控制方法，不确定性感知的储层预测路径积分控制方法将控制性能提升了最高达60%的控制成本降低。

0

相关内容

基于机器学习的交通流预测方法综述

基于机器学习的交通流预测方法综述

专知会员服务

35+阅读 · 2023年8月17日

多智能体系统带宽分配及预测云控制

多智能体系统带宽分配及预测云控制

专知会员服务

18+阅读 · 2023年7月9日

【CVPR2023】DynamicDet:目标检测的统一动态架构

【CVPR2023】DynamicDet:目标检测的统一动态架构

专知会员服务

26+阅读 · 2023年4月15日

【NeurIPS2022】解析动力学系统中物理信息图神经网络的性能

【NeurIPS2022】解析动力学系统中物理信息图神经网络的性能

专知会员服务

19+阅读 · 2022年11月12日

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

清华大学等首篇「动态神经网络」最新综述论文，20页pdf236篇文献

清华大学等首篇「动态神经网络」最新综述论文，20页pdf236篇文献

专知会员服务

80+阅读 · 2021年2月21日

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

专知会员服务

199+阅读 · 2021年2月17日

【清华大学】图神经网络交通流预测综述论文，19页pdf

【清华大学】图神经网络交通流预测综述论文，19页pdf

专知会员服务

50+阅读 · 2021年1月29日

解决非线性逆问题的新型深度神经网络，30页ppt，University of Helsinki

解决非线性逆问题的新型深度神经网络，30页ppt，University of Helsinki

专知会员服务

23+阅读 · 2020年4月29日

【干货书】《机器学习动力系统与控制》，572页pdf

【干货书】《机器学习动力系统与控制》，572页pdf

专知

36+阅读 · 2022年1月8日

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

专知

36+阅读 · 2020年5月19日

【WWW2020】结构深度聚类网络， Structural Deep Clustering Network，北京邮电大学

【WWW2020】结构深度聚类网络， Structural Deep Clustering Network，北京邮电大学

专知

31+阅读 · 2020年2月19日

【论文笔记】具有可微分池化的分层图表示学习

【论文笔记】具有可微分池化的分层图表示学习

专知

47+阅读 · 2019年11月11日

神经网络常微分方程 (Neural ODEs) 解析

神经网络常微分方程 (Neural ODEs) 解析

AI科技评论

42+阅读 · 2019年8月9日

ICLR 2019论文解读：深度学习应用于复杂系统控制

ICLR 2019论文解读：深度学习应用于复杂系统控制

机器之心

11+阅读 · 2019年1月10日

SFFAI报告 | 常建龙：深度卷积网络中的卷积算子研究进展

SFFAI报告 | 常建龙：深度卷积网络中的卷积算子研究进展

人工智能前沿讲习班

11+阅读 · 2018年10月22日

专栏 | 浅析图卷积神经网络

专栏 | 浅析图卷积神经网络

机器之心

28+阅读 · 2018年7月4日

【干货】基于TensorFlow卷积神经网络的短期股票预测

【干货】基于TensorFlow卷积神经网络的短期股票预测

专知

19+阅读 · 2017年12月15日

基于注意力机制的图卷积网络

基于注意力机制的图卷积网络

科技创新与创业

74+阅读 · 2017年11月8日

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

不确定分数阶非线性系统Mittag-Leffler自适应控制

国家自然科学基金

1+阅读 · 2016年12月31日

网络化非线性系统的协调控制及其在分布式可重构航天器中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于动态网络结构的膜计算系统及其算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向估计性能优化的网络化控制系统传感器调度

国家自然科学基金

0+阅读 · 2015年12月31日

具有消极关系的耦合非线性系统同步与控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

一类不确定非线性大系统的非光滑分散控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

神经元网络系统的斑图动力学行为分析及控制

国家自然科学基金

0+阅读 · 2014年12月31日

反馈神经网络统一模型临界动力学研究及其在类脑计算机研制中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于非线性动力学的复杂网络结构识别及其在力学系统中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

Graph Neural Model Predictive Control for High-Dimensional Systems

Graph Neural Model Predictive Control for High-Dimensional Systems

Arxiv

0+阅读 · 2月19日

Nonlinear Predictive Control of the Continuum and Hybrid Dynamics of a Suspended Deformable Cable for Aerial Pick and Place

Arxiv

0+阅读 · 2月19日

Nonplanar Model Predictive Control for Autonomous Vehicles with Recursive Sparse Gaussian Process Dynamics

Arxiv

0+阅读 · 2月18日

Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks

Arxiv

0+阅读 · 2月16日

Reduced-order Control and Geometric Structure of Learned Lagrangian Latent Dynamics

Arxiv

0+阅读 · 2月9日

Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows

Arxiv

0+阅读 · 2月6日

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of dynamical systems

Arxiv

0+阅读 · 2月2日

Pathwise Learning of Stochastic Dynamical Systems with Partial Observations

Arxiv

0+阅读 · 1月29日

Domain-specific Hardware Acceleration for Model Predictive Path Integral Control

Arxiv

0+阅读 · 1月17日

Latent Dynamics Graph Convolutional Networks for model order reduction of parameterized time-dependent PDEs

Arxiv

0+阅读 · 1月16日

VIP会员

文章信息

相关主题

非线性动力

非线性动力学

动力学系统

相关VIP内容

基于机器学习的交通流预测方法综述

基于机器学习的交通流预测方法综述

专知会员服务

35+阅读 · 2023年8月17日

多智能体系统带宽分配及预测云控制

多智能体系统带宽分配及预测云控制

专知会员服务

18+阅读 · 2023年7月9日

【CVPR2023】DynamicDet:目标检测的统一动态架构

【CVPR2023】DynamicDet:目标检测的统一动态架构

专知会员服务

26+阅读 · 2023年4月15日

【NeurIPS2022】解析动力学系统中物理信息图神经网络的性能

【NeurIPS2022】解析动力学系统中物理信息图神经网络的性能

专知会员服务

19+阅读 · 2022年11月12日

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

清华大学等首篇「动态神经网络」最新综述论文，20页pdf236篇文献

清华大学等首篇「动态神经网络」最新综述论文，20页pdf236篇文献

专知会员服务

80+阅读 · 2021年2月21日

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

《数据驱动的科学与工程——机器学习、动力系统与控制》，572页pdf

专知会员服务

199+阅读 · 2021年2月17日

【清华大学】图神经网络交通流预测综述论文，19页pdf

【清华大学】图神经网络交通流预测综述论文，19页pdf

专知会员服务

50+阅读 · 2021年1月29日

解决非线性逆问题的新型深度神经网络，30页ppt，University of Helsinki

解决非线性逆问题的新型深度神经网络，30页ppt，University of Helsinki

专知会员服务

23+阅读 · 2020年4月29日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

【干货书】《机器学习动力系统与控制》，572页pdf

【干货书】《机器学习动力系统与控制》，572页pdf

专知

36+阅读 · 2022年1月8日

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

专知

36+阅读 · 2020年5月19日

【WWW2020】结构深度聚类网络， Structural Deep Clustering Network，北京邮电大学

【WWW2020】结构深度聚类网络， Structural Deep Clustering Network，北京邮电大学

专知

31+阅读 · 2020年2月19日

【论文笔记】具有可微分池化的分层图表示学习

【论文笔记】具有可微分池化的分层图表示学习

专知

47+阅读 · 2019年11月11日

神经网络常微分方程 (Neural ODEs) 解析

神经网络常微分方程 (Neural ODEs) 解析

AI科技评论

42+阅读 · 2019年8月9日

ICLR 2019论文解读：深度学习应用于复杂系统控制

ICLR 2019论文解读：深度学习应用于复杂系统控制

机器之心

11+阅读 · 2019年1月10日

SFFAI报告 | 常建龙：深度卷积网络中的卷积算子研究进展

SFFAI报告 | 常建龙：深度卷积网络中的卷积算子研究进展

人工智能前沿讲习班

11+阅读 · 2018年10月22日

专栏 | 浅析图卷积神经网络

专栏 | 浅析图卷积神经网络

机器之心

28+阅读 · 2018年7月4日

【干货】基于TensorFlow卷积神经网络的短期股票预测

【干货】基于TensorFlow卷积神经网络的短期股票预测

专知

19+阅读 · 2017年12月15日

基于注意力机制的图卷积网络

基于注意力机制的图卷积网络

科技创新与创业

74+阅读 · 2017年11月8日

相关论文

Graph Neural Model Predictive Control for High-Dimensional Systems

Graph Neural Model Predictive Control for High-Dimensional Systems

Arxiv

0+阅读 · 2月19日

Nonlinear Predictive Control of the Continuum and Hybrid Dynamics of a Suspended Deformable Cable for Aerial Pick and Place

Arxiv

0+阅读 · 2月19日

Nonplanar Model Predictive Control for Autonomous Vehicles with Recursive Sparse Gaussian Process Dynamics

Arxiv

0+阅读 · 2月18日

Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks

Arxiv

0+阅读 · 2月16日

Reduced-order Control and Geometric Structure of Learned Lagrangian Latent Dynamics

Arxiv

0+阅读 · 2月9日

Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows

Arxiv

0+阅读 · 2月6日

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of dynamical systems

Arxiv

0+阅读 · 2月2日

Pathwise Learning of Stochastic Dynamical Systems with Partial Observations

Arxiv

0+阅读 · 1月29日

Domain-specific Hardware Acceleration for Model Predictive Path Integral Control

Arxiv

0+阅读 · 1月17日

Latent Dynamics Graph Convolutional Networks for model order reduction of parameterized time-dependent PDEs

Arxiv

0+阅读 · 1月16日

相关基金

基于动态反馈的时滞非线性系统控制理论研究

国家自然科学基金

0+阅读 · 2017年12月31日

不确定分数阶非线性系统Mittag-Leffler自适应控制

国家自然科学基金

1+阅读 · 2016年12月31日

网络化非线性系统的协调控制及其在分布式可重构航天器中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于动态网络结构的膜计算系统及其算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向估计性能优化的网络化控制系统传感器调度

国家自然科学基金

0+阅读 · 2015年12月31日

具有消极关系的耦合非线性系统同步与控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

一类不确定非线性大系统的非光滑分散控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

神经元网络系统的斑图动力学行为分析及控制

国家自然科学基金

0+阅读 · 2014年12月31日

反馈神经网络统一模型临界动力学研究及其在类脑计算机研制中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于非线性动力学的复杂网络结构识别及其在力学系统中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员