强化学习补偿模型预测控制在未知可变形地形越野驾驶中的应用 (Reinforcement Learning Compensated Model Predictive Control for Off-road Driving on Unknown Deformable Terrain) - 专知论文

会员服务 ·

0

控制器 · 变形 · 模型预测 · 模型预测控制 · 预测控制 ·

Reinforcement Learning Compensated Model Predictive Control for Off-road Driving on Unknown Deformable Terrain

翻译：强化学习补偿模型预测控制在未知可变形地形越野驾驶中的应用

Prakhar Gupta,Jonathon M. Smereka,Yunyi Jia

from arxiv, Submitted to IEEE Transactions on Intelligent Vehicles as a Regular Paper; was withdrawn in March 2025. A revised version of this manuscript was submitted to ACC 2025 review as a regular paper in Sep 2025

This study presents an Actor-Critic reinforcement learning Compensated Model Predictive Controller (AC2MPC) designed for high-speed, off-road autonomous driving on deformable terrains. Addressing the difficulty of modeling unknown tire-terrain interaction and ensuring real-time control feasibility and performance, this framework integrates deep reinforcement learning with a model predictive controller to manage unmodeled nonlinear dynamics. We evaluate the controller framework over constant and varying velocity profiles using high-fidelity simulator Project Chrono. Our findings demonstrate that our controller statistically outperforms standalone model-based and learning-based controllers over three unknown terrains that represent sandy deformable track, sandy and rocky track and cohesive clay-like deformable soil track. Despite varied and previously unseen terrain characteristics, this framework generalized well enough to track longitudinal reference speeds with the least error. Furthermore, this framework required significantly less training data compared to purely learning based controller, converging in fewer steps while delivering better performance. Even when under-trained, this controller outperformed the standalone controllers, highlighting its potential for safer and more efficient real-world deployment.

翻译：本研究提出了一种基于Actor-Critic强化学习的补偿模型预测控制器（AC2MPC），专为可变形地形上的高速越野自动驾驶设计。针对未知轮胎-地形相互作用建模困难以及实时控制可行性与性能保障的挑战，该框架将深度强化学习与模型预测控制器相结合，以处理未建模的非线性动力学。我们使用高保真仿真器Project Chrono，在恒定与变化的速度曲线下对该控制器框架进行评估。实验结果表明，在代表沙质可变形路径、沙石混合路径以及黏性类黏土可变形土壤路径的三种未知地形上，我们的控制器在统计意义上优于独立的基于模型的控制器与基于学习的控制器。尽管面对多样且先前未见的地形特征，该框架仍展现出良好的泛化能力，能以最小误差跟踪纵向参考速度。此外，与纯学习型控制器相比，该框架所需的训练数据显著减少，收敛步数更少且性能更优。即使在训练不足的情况下，该控制器仍优于独立控制器，突显了其在现实世界中更安全、更高效部署的潜力。

0

相关内容

控制器

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

专知会员服务

43+阅读 · 2025年9月23日

《2对2超视距空战机动问题的强化学习方法》最新126页

《2对2超视距空战机动问题的强化学习方法》最新126页

专知会员服务

102+阅读 · 2025年3月11日

LargeAD：面向自动驾驶的大规模跨传感器数据预训练

LargeAD：面向自动驾驶的大规模跨传感器数据预训练

专知会员服务

17+阅读 · 2025年1月8日

「强化学习在无人车领域」的应用与展望

「强化学习在无人车领域」的应用与展望

专知会员服务

58+阅读 · 2022年12月8日

深度预测学习：模型与应用

深度预测学习：模型与应用

专知会员服务

49+阅读 · 2022年12月5日

强化学习的自动驾驶控制技术研究进展

专知会员服务

140+阅读 · 2021年2月17日

【综述】自动驾驶领域中的强化学习，附18页论文下载

【综述】自动驾驶领域中的强化学习，附18页论文下载

专知会员服务

176+阅读 · 2020年2月8日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

基于模型的强化学习综述

基于模型的强化学习综述

专知

42+阅读 · 2022年7月13日

【Amazon】使用预训练Transformer模型进行数据增强

【Amazon】使用预训练Transformer模型进行数据增强

专知

12+阅读 · 2020年3月6日

深度学习技术在自动驾驶中的应用

深度学习技术在自动驾驶中的应用

智能交通技术

26+阅读 · 2019年10月27日

深度学习在自动驾驶感知领域的应用

深度学习在自动驾驶感知领域的应用

AI100

11+阅读 · 2019年3月6日

TensorFlow 2.0深度强化学习指南

TensorFlow 2.0深度强化学习指南

云栖社区

18+阅读 · 2019年2月1日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

每日论文 | CV中深度学习涉及到的几何和不确定性；用深度学习分析气象；可自动调整模拟器参数的模型

每日论文 | CV中深度学习涉及到的几何和不确定性；用深度学习分析气象；可自动调整模拟器参数的模型

论智

11+阅读 · 2018年10月9日

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

产业智能官

49+阅读 · 2018年7月4日

李克强：智能车辆运动控制研究综述

李克强：智能车辆运动控制研究综述

厚势

21+阅读 · 2017年10月17日

入口匝道自适应巡航车流交通特性及控制策略研究

国家自然科学基金

0+阅读 · 2017年12月31日

基于子模优化的远程预警传感器管理研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向绿色交通的智能车辆变工况行驶能耗反馈与耗散控制方法

国家自然科学基金

0+阅读 · 2015年12月31日

面向主动安全控制的工程车辆动态信息获取与状态辨识

国家自然科学基金

0+阅读 · 2015年12月31日

混合交通环境中自动驾驶汽车安全可达性分析与优化控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于模型预测的AUV三维轨迹跟踪控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于多场信息数据驱动的滑坡演化多模式切换概率预测和控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

分布式滑坡形变PSI（永久散射体干涉雷达）监测模型及技术

国家自然科学基金

0+阅读 · 2014年12月31日

变形监测中无线传感器网络应用的理论与技术

国家自然科学基金

0+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

Nonplanar Model Predictive Control for Autonomous Vehicles with Recursive Sparse Gaussian Process Dynamics

Arxiv

0+阅读 · 2月18日

TRANS: Terrain-aware Reinforcement Learning for Agile Navigation of Quadruped Robots under Social Interactions

Arxiv

0+阅读 · 2月13日

Composable Model-Free RL for Navigation with Input-Affine Systems

Arxiv

0+阅读 · 2月13日

Localized Graph-Based Neural Dynamics Models for Terrain Manipulation

Arxiv

0+阅读 · 2月11日

HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving

Arxiv

0+阅读 · 2月6日

Transformer-Based Reinforcement Learning for Autonomous Orbital Collision Avoidance in Partially Observable Environments

Arxiv

0+阅读 · 2月5日

Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning

Arxiv

0+阅读 · 2月4日

Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

Arxiv

0+阅读 · 2月2日

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Efficiently Learning Robust Torque-based Locomotion Through Reinforcement with Model-Based Supervision

Arxiv

0+阅读 · 1月22日

VIP会员

文章信息

相关主题

模型预测控制

相关VIP内容

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

专知会员服务

43+阅读 · 2025年9月23日

《2对2超视距空战机动问题的强化学习方法》最新126页

《2对2超视距空战机动问题的强化学习方法》最新126页

专知会员服务

102+阅读 · 2025年3月11日

LargeAD：面向自动驾驶的大规模跨传感器数据预训练

LargeAD：面向自动驾驶的大规模跨传感器数据预训练

专知会员服务

17+阅读 · 2025年1月8日

「强化学习在无人车领域」的应用与展望

「强化学习在无人车领域」的应用与展望

专知会员服务

58+阅读 · 2022年12月8日

深度预测学习：模型与应用

深度预测学习：模型与应用

专知会员服务

49+阅读 · 2022年12月5日

强化学习的自动驾驶控制技术研究进展

专知会员服务

140+阅读 · 2021年2月17日

【综述】自动驾驶领域中的强化学习，附18页论文下载

【综述】自动驾驶领域中的强化学习，附18页论文下载

专知会员服务

176+阅读 · 2020年2月8日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

热门VIP内容

开通专知VIP会员享更多权益服务

《可信人工智能赋能系统的支柱》

《从经典神经网络到不确定性下的拓扑神经网络：军事应用》2026最新40页报告

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

《人工智能：对战略与力量的影响》slides

相关资讯

基于模型的强化学习综述

基于模型的强化学习综述

专知

42+阅读 · 2022年7月13日

【Amazon】使用预训练Transformer模型进行数据增强

【Amazon】使用预训练Transformer模型进行数据增强

专知

12+阅读 · 2020年3月6日

深度学习技术在自动驾驶中的应用

深度学习技术在自动驾驶中的应用

智能交通技术

26+阅读 · 2019年10月27日

深度学习在自动驾驶感知领域的应用

深度学习在自动驾驶感知领域的应用

AI100

11+阅读 · 2019年3月6日

TensorFlow 2.0深度强化学习指南

TensorFlow 2.0深度强化学习指南

云栖社区

18+阅读 · 2019年2月1日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

每日论文 | CV中深度学习涉及到的几何和不确定性；用深度学习分析气象；可自动调整模拟器参数的模型

每日论文 | CV中深度学习涉及到的几何和不确定性；用深度学习分析气象；可自动调整模拟器参数的模型

论智

11+阅读 · 2018年10月9日

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

【强化学习】强化学习与控制理论的区别与联系；深度强化学习的课程笔记。

产业智能官

49+阅读 · 2018年7月4日

李克强：智能车辆运动控制研究综述

李克强：智能车辆运动控制研究综述

厚势

21+阅读 · 2017年10月17日

相关论文

Nonplanar Model Predictive Control for Autonomous Vehicles with Recursive Sparse Gaussian Process Dynamics

Arxiv

0+阅读 · 2月18日

TRANS: Terrain-aware Reinforcement Learning for Agile Navigation of Quadruped Robots under Social Interactions

Arxiv

0+阅读 · 2月13日

Composable Model-Free RL for Navigation with Input-Affine Systems

Arxiv

0+阅读 · 2月13日

Localized Graph-Based Neural Dynamics Models for Terrain Manipulation

Arxiv

0+阅读 · 2月11日

HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving

Arxiv

0+阅读 · 2月6日

Transformer-Based Reinforcement Learning for Autonomous Orbital Collision Avoidance in Partially Observable Environments

Arxiv

0+阅读 · 2月5日

Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning

Arxiv

0+阅读 · 2月4日

Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

Arxiv

0+阅读 · 2月2日

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Efficiently Learning Robust Torque-based Locomotion Through Reinforcement with Model-Based Supervision

Arxiv

0+阅读 · 1月22日

相关基金

入口匝道自适应巡航车流交通特性及控制策略研究

国家自然科学基金

0+阅读 · 2017年12月31日

基于子模优化的远程预警传感器管理研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向绿色交通的智能车辆变工况行驶能耗反馈与耗散控制方法

国家自然科学基金

0+阅读 · 2015年12月31日

面向主动安全控制的工程车辆动态信息获取与状态辨识

国家自然科学基金

0+阅读 · 2015年12月31日

混合交通环境中自动驾驶汽车安全可达性分析与优化控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于模型预测的AUV三维轨迹跟踪控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于多场信息数据驱动的滑坡演化多模式切换概率预测和控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

分布式滑坡形变PSI（永久散射体干涉雷达）监测模型及技术

国家自然科学基金

0+阅读 · 2014年12月31日

变形监测中无线传感器网络应用的理论与技术

国家自然科学基金

0+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员