Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning - 专知论文

会员服务 ·

0

运动规划 · 控制策略 · 水下 · 自适应 · 深度强化学习 ·

2023 年 4 月 1 日

Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning

翻译：基于深度强化学习的自主水下航行器自适应编队运动规划与控制

Behnaz Hadi,Alireza Khosravi,Pouria Sarhadi

Creating safe paths in unknown and uncertain environments is a challenging aspect of leader-follower formation control. In this architecture, the leader moves toward the target by taking optimal actions, and followers should also avoid obstacles while maintaining their desired formation shape. Most of the studies in this field have inspected formation control and obstacle avoidance separately. The present study proposes a new approach based on deep reinforcement learning (DRL) for end-to-end motion planning and control of under-actuated autonomous underwater vehicles (AUVs). The aim is to design optimal adaptive distributed controllers based on actor-critic structure for AUVs formation motion planning. This is accomplished by controlling the speed and heading of AUVs. In obstacle avoidance, two approaches have been deployed. In the first approach, the goal is to design control policies for the leader and followers such that each learns its own collision-free path. Moreover, the followers adhere to an overall formation maintenance policy. In the second approach, the leader solely learns the control policy, and safely leads the whole group towards the target. Here, the control policy of the followers is to maintain the predetermined distance and angle. In the presence of ocean currents, communication delays, and sensing errors, the robustness of the proposed method under realistically perturbed circumstances is shown. The efficiency of the algorithms has been evaluated and approved using a number of computer-based simulations.

翻译：在未知和不确定环境中创建安全路径是领航-跟随编队控制中的一个挑战性方面。在该架构中，领航者通过采取最优动作向目标移动，而跟随者需在保持期望编队形状的同时避开障碍物。该领域的大多数研究分别考察了编队控制与障碍物规避问题。本研究提出了一种基于深度强化学习的新型端到端运动规划与控制方法，用于欠驱动自主水下航行器。目标在于基于执行器-评判器结构设计最优自适应分布式控制器，实现自主水下航行器编队运动规划，通过控制航行器的速度和航向完成该任务。在障碍物规避方面，采用了两种方法。第一种方法旨在为领航者和跟随者设计控制策略，使每个个体学习各自的避碰路径，同时跟随者需遵循整体编队保持策略。第二种方法中，仅领航者学习控制策略，并安全引导整个编队向目标移动，此时跟随者的控制策略是保持预设距离和角度。在海流、通信延迟和传感误差存在的情况下，验证了所提方法在现实扰动条件下的鲁棒性。通过一系列基于计算机仿真的实验，评估并确认了算法的有效性。

0

相关内容

运动规划

《物联网中协同机器人的自适应任务规划》295页博士论文，马德里理工大学

《物联网中协同机器人的自适应任务规划》295页博士论文，马德里理工大学

专知会员服务

78+阅读 · 2022年11月24日

《使用强化学习的无人作战飞行器机队协同规划》12页论文

《使用强化学习的无人作战飞行器机队协同规划》12页论文

专知会员服务

165+阅读 · 2022年11月14日

《使用模型预测控制和博弈论方法的移动机器人实时控制》140页博士论文

《使用模型预测控制和博弈论方法的移动机器人实时控制》140页博士论文

专知会员服务

56+阅读 · 2022年6月16日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

237+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

24+阅读 · 2022年3月19日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

去中心化多智能体导航的基于模型的强化学习 (RL)

去中心化多智能体导航的基于模型的强化学习 (RL)

TensorFlow

13+阅读 · 2021年6月24日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

泡泡机器人SLAM

20+阅读 · 2019年4月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【泡泡一分钟】无人机与地面机器人协作三维建图（IROS-2）

【泡泡一分钟】无人机与地面机器人协作三维建图（IROS-2）

泡泡机器人SLAM

31+阅读 · 2018年1月8日

不确定与动态信息环境下基于预规划-重规划集成建模的应急物流选址-调度鲁棒优化研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于自适应动态规划的非线性系统鲁棒控制与分散镇定

国家自然科学基金

3+阅读 · 2013年12月31日

非线性随机脉冲时滞系统的稳定性分析、控制与滤波及应用

国家自然科学基金

0+阅读 · 2012年12月31日

不连续耦合多智能体时滞网络化系统的协调控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于回声状态网络的有限时间非线性系统自适应最优控制

国家自然科学基金

1+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

水下潜器三维空间路径规划方法与仿真研究

国家自然科学基金

5+阅读 · 2011年12月31日

随机切换系统在异步切换下的鲁棒控制与滤波

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

Reimagining Demand-Side Management with Mean Field Learning

Arxiv

0+阅读 · 2023年5月25日

Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments

Arxiv

0+阅读 · 2023年5月24日

KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning World Models in Autonomous Driving Tasks

Arxiv

0+阅读 · 2023年5月24日

Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control

Arxiv

0+阅读 · 2023年5月23日

Learning Feasibility of Factored Nonlinear Programs in Robotic Manipulation Planning

Arxiv

0+阅读 · 2023年5月23日

Deep Reinforcement Learning-based Multi-objective Path Planning on the Off-road Terrain Environment for Ground Vehicles

Arxiv

0+阅读 · 2023年5月23日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

17+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

深度强化学习

最新内容

《无人机对海面作战影响评估》

《无人机对海面作战影响评估》

专知会员服务

9+阅读 · 7月21日

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

专知会员服务

8+阅读 · 7月21日

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

专知会员服务

3+阅读 · 7月21日

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

专知会员服务

5+阅读 · 7月21日

五角大楼新设无人机办公室（DRPM-UxS）将如何重塑美国无人系统格局（附美国防部设立备忘录）

五角大楼新设无人机办公室（DRPM-UxS）将如何重塑美国无人系统格局（附美国防部设立备忘录）

专知会员服务

6+阅读 · 7月21日

印度精确打击与指挥架构的断层

印度精确打击与指挥架构的断层

专知会员服务

5+阅读 · 7月20日

《NASA喷气推进实验室：高耐久轻质常驻空观测系统（HELIOS）》429页

《NASA喷气推进实验室：高耐久轻质常驻空观测系统（HELIOS）》429页

专知会员服务

7+阅读 · 7月20日

美空军AI完成F-16战斗机自主空战历史性试飞

美空军AI完成F-16战斗机自主空战历史性试飞

专知会员服务

6+阅读 · 7月20日

《美政府问责局——武器系统年度评估（2026年）：强制要求成熟技术或可推动转向快速交付》249页

《美政府问责局——武器系统年度评估（2026年）：强制要求成熟技术或可推动转向快速交付》249页

专知会员服务

8+阅读 · 7月20日

《美国陆军：通过弹性分布式模型库实现自适应AI优势》

《美国陆军：通过弹性分布式模型库实现自适应AI优势》

专知会员服务

7+阅读 · 7月20日

博士论文 | 理解与改进大语言模型推理：从反转诅咒到连续思维链

博士论文 | 理解与改进大语言模型推理：从反转诅咒到连续思维链

专知会员服务

9+阅读 · 7月20日

综述 | 终身视觉表征：持续自监督学习CSSL系统综述

综述 | 终身视觉表征：持续自监督学习CSSL系统综述

专知会员服务

8+阅读 · 7月20日

深入Project Maven：为何人工智能在战场上依然失灵

深入Project Maven：为何人工智能在战场上依然失灵

专知会员服务

15+阅读 · 7月19日

锻造未来士兵：外骨骼、基因工程与赛博格

锻造未来士兵：外骨骼、基因工程与赛博格

专知会员服务

8+阅读 · 7月19日

《无人机系统（UAS）通信网状网络试验性部署》50页报告

《无人机系统（UAS）通信网状网络试验性部署》50页报告

专知会员服务

10+阅读 · 7月19日

相关VIP内容

《物联网中协同机器人的自适应任务规划》295页博士论文，马德里理工大学

《物联网中协同机器人的自适应任务规划》295页博士论文，马德里理工大学

专知会员服务

78+阅读 · 2022年11月24日

《使用强化学习的无人作战飞行器机队协同规划》12页论文

《使用强化学习的无人作战飞行器机队协同规划》12页论文

专知会员服务

165+阅读 · 2022年11月14日

《使用模型预测控制和博弈论方法的移动机器人实时控制》140页博士论文

《使用模型预测控制和博弈论方法的移动机器人实时控制》140页博士论文

专知会员服务

56+阅读 · 2022年6月16日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

237+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

24+阅读 · 2022年3月19日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

《无人机对海面作战影响评估》

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

相关资讯

去中心化多智能体导航的基于模型的强化学习 (RL)

去中心化多智能体导航的基于模型的强化学习 (RL)

TensorFlow

13+阅读 · 2021年6月24日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

泡泡机器人SLAM

20+阅读 · 2019年4月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【泡泡一分钟】无人机与地面机器人协作三维建图（IROS-2）

【泡泡一分钟】无人机与地面机器人协作三维建图（IROS-2）

泡泡机器人SLAM

31+阅读 · 2018年1月8日

相关论文

Reimagining Demand-Side Management with Mean Field Learning

Arxiv

0+阅读 · 2023年5月25日

Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments

Arxiv

0+阅读 · 2023年5月24日

KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning World Models in Autonomous Driving Tasks

Arxiv

0+阅读 · 2023年5月24日

Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control

Arxiv

0+阅读 · 2023年5月23日

Learning Feasibility of Factored Nonlinear Programs in Robotic Manipulation Planning

Arxiv

0+阅读 · 2023年5月23日

Deep Reinforcement Learning-based Multi-objective Path Planning on the Off-road Terrain Environment for Ground Vehicles

Arxiv

0+阅读 · 2023年5月23日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

17+阅读 · 2018年6月27日

相关基金

不确定与动态信息环境下基于预规划-重规划集成建模的应急物流选址-调度鲁棒优化研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于自适应动态规划的非线性系统鲁棒控制与分散镇定

国家自然科学基金

3+阅读 · 2013年12月31日

非线性随机脉冲时滞系统的稳定性分析、控制与滤波及应用

国家自然科学基金

0+阅读 · 2012年12月31日

不连续耦合多智能体时滞网络化系统的协调控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于回声状态网络的有限时间非线性系统自适应最优控制

国家自然科学基金

1+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

水下潜器三维空间路径规划方法与仿真研究

国家自然科学基金

5+阅读 · 2011年12月31日

随机切换系统在异步切换下的鲁棒控制与滤波

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员