基于多智能体强化学习的杜罗河羽流长期测绘 (Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning) - 专知论文

会员服务 ·

0

智能体 · 多智能体 · 控制器 · 续航能力 · 多智能体强化学习 ·

Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning

翻译：基于多智能体强化学习的杜罗河羽流长期测绘

Nicolò Dal Fabbro,Milad Mesbahi,Renato Mendes,João Borges de Sousa,George J. Pappas

from arxiv, Accepted at the 2026 IEEE International Conference on Robotics and Automation

We study the problem of long-term (multiple days) mapping of a river plume using multiple autonomous underwater vehicles (AUVs), focusing on the Douro river representative use-case. We propose an energy - and communication - efficient multi-agent reinforcement learning approach in which a central coordinator intermittently communicates with the AUVs, collecting measurements and issuing commands. Our approach integrates spatiotemporal Gaussian process regression (GPR) with a multi-head Q-network controller that regulates direction and speed for each AUV. Simulations using the Delft3D ocean model demonstrate that our method consistently outperforms both single- and multi-agent benchmarks, with scaling the number of agents both improving mean squared error (MSE) and operational endurance. In some instances, our algorithm demonstrates that doubling the number of AUVs can more than double endurance while maintaining or improving accuracy, underscoring the benefits of multi-agent coordination. Our learned policies generalize across unseen seasonal regimes over different months and years, demonstrating promise for future developments of data-driven long-term monitoring of dynamic plume environments.

翻译：本研究探讨了利用多台自主水下航行器（AUV）对河流羽流进行长期（多日）测绘的问题，重点关注杜罗河这一典型应用场景。我们提出了一种能源高效且通信高效的多智能体强化学习方法，其中中央协调器间歇性地与AUV进行通信，收集测量数据并下达指令。该方法将时空高斯过程回归（GPR）与一个多头部Q网络控制器相结合，该控制器可调节每台AUV的方向和速度。使用Delft3D海洋模型进行的仿真表明，我们的方法在均方误差（MSE）和运行续航能力方面均持续优于单智能体与多智能体基准方法，且增加智能体数量能同时改善这两项指标。在某些情况下，我们的算法表明，在保持或提升精度的同时，将AUV数量翻倍可使续航能力提升一倍以上，这凸显了多智能体协同的优势。我们学习到的策略能够泛化至不同月份和年份中未见过的季节性水文状况，这为未来数据驱动的动态羽流环境长期监测研究展示了良好前景。

0

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

多智能体强化学习中的稳健且高效的通信

多智能体强化学习中的稳健且高效的通信

专知会员服务

25+阅读 · 2025年11月17日

面向关系建模的合作多智能体深度强化学习综述

面向关系建模的合作多智能体深度强化学习综述

专知会员服务

39+阅读 · 2025年4月18日

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图》最新论文

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图》最新论文

专知会员服务

66+阅读 · 2025年2月21日

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图景》

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图景》

专知会员服务

87+阅读 · 2024年12月2日

多智能体强化学习控制与决策研究综述

多智能体强化学习控制与决策研究综述

专知会员服务

46+阅读 · 2024年11月23日

【Nature Machine Intelligence】大规模多智能体系统的高效强化学习

【Nature Machine Intelligence】大规模多智能体系统的高效强化学习

专知会员服务

44+阅读 · 2024年9月7日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

《远程自主水下航行器性能研究：增强AUV续航能力》2022最新博士论文，瑞典皇家理工学院

《远程自主水下航行器性能研究：增强AUV续航能力》2022最新博士论文，瑞典皇家理工学院

专知会员服务

37+阅读 · 2022年12月25日

多智能体深度强化学习：综述

专知会员服务

170+阅读 · 2021年8月3日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

博弈论视角下的多智能体强化学习综述,129页pdf与76页Slides

博弈论视角下的多智能体强化学习综述,129页pdf与76页Slides

专知

11+阅读 · 2022年11月26日

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

《可解释人工智能在多域作战中的智能增强》美国陆军、IBM、卡迪夫大学等论文

《可解释人工智能在多域作战中的智能增强》美国陆军、IBM、卡迪夫大学等论文

专知

67+阅读 · 2022年11月2日

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

专知

15+阅读 · 2022年7月22日

「博弈论视角下多智能体强化学习」研究综述

「博弈论视角下多智能体强化学习」研究综述

专知

58+阅读 · 2022年4月30日

【综述】多智能体强化学习算法理论研究

【综述】多智能体强化学习算法理论研究

深度强化学习实验室

15+阅读 · 2020年9月9日

多智能体强化学习（MARL）近年研究概览

多智能体强化学习（MARL）近年研究概览

PaperWeekly

38+阅读 · 2020年3月15日

DeepMind：用PopArt进行多任务深度强化学习

DeepMind：用PopArt进行多任务深度强化学习

论智

29+阅读 · 2018年9月14日

DeepMind无监督表示学习重大突破：语音、图像、文本、强化学习全能冠军！

DeepMind无监督表示学习重大突破：语音、图像、文本、强化学习全能冠军！

新智元

12+阅读 · 2018年7月13日

【学界】谷歌大脑提出自动数据增强方法AutoAugment：可迁移至不同数据集

【学界】谷歌大脑提出自动数据增强方法AutoAugment：可迁移至不同数据集

GAN生成式对抗网络

11+阅读 · 2018年6月5日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

42+阅读 · 2015年12月31日

面向水电开发争端预警的国际河流流域空间数据知识组织研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

具有动态不确定性的下三角多智能体系统分布式自适应协同控制

国家自然科学基金

3+阅读 · 2015年12月31日

噪声不确定下基于计算智能的多跳认知无线电网络协作频谱感知优化

国家自然科学基金

0+阅读 · 2015年12月31日

带有通信量化和延时的多智能体系统一致性研究

国家自然科学基金

0+阅读 · 2014年12月31日

短波认知ALE系统中基于深度学习-GP混合模型的多维谱预测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

Arxiv

0+阅读 · 2月13日

Bayesian Ego-graph Inference for Networked Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2月13日

LongNav-R1: Horizon-Adaptive Multi-Turn RL for Long-Horizon VLA Navigation

Arxiv

0+阅读 · 2月12日

Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements

Arxiv

0+阅读 · 2月11日

DHEA-MECD: An Embodied Intelligence-Powered DRL Algorithm for AUV Tracking in Underwater Environments with High-Dimensional Features

Arxiv

0+阅读 · 2月8日

TodoEvolve: Learning to Architect Agent Planning Systems

Arxiv

0+阅读 · 2月8日

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Arxiv

0+阅读 · 2月3日

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Arxiv

0+阅读 · 1月26日

Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control

Arxiv

0+阅读 · 1月21日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

VIP会员

文章信息

相关主题

多智能体强化学习

相关VIP内容

多智能体强化学习中的稳健且高效的通信

多智能体强化学习中的稳健且高效的通信

专知会员服务

25+阅读 · 2025年11月17日

面向关系建模的合作多智能体深度强化学习综述

面向关系建模的合作多智能体深度强化学习综述

专知会员服务

39+阅读 · 2025年4月18日

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图》最新论文

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图》最新论文

专知会员服务

66+阅读 · 2025年2月21日

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图景》

《基于多智能体强化学习的异构平台数据驱动分布式共同作战图景》

专知会员服务

87+阅读 · 2024年12月2日

多智能体强化学习控制与决策研究综述

多智能体强化学习控制与决策研究综述

专知会员服务

46+阅读 · 2024年11月23日

【Nature Machine Intelligence】大规模多智能体系统的高效强化学习

【Nature Machine Intelligence】大规模多智能体系统的高效强化学习

专知会员服务

44+阅读 · 2024年9月7日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

《远程自主水下航行器性能研究：增强AUV续航能力》2022最新博士论文，瑞典皇家理工学院

《远程自主水下航行器性能研究：增强AUV续航能力》2022最新博士论文，瑞典皇家理工学院

专知会员服务

37+阅读 · 2022年12月25日

多智能体深度强化学习：综述

专知会员服务

170+阅读 · 2021年8月3日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

《可信人工智能赋能系统的支柱》

《从经典神经网络到不确定性下的拓扑神经网络：军事应用》2026最新40页报告

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

《人工智能：对战略与力量的影响》slides

相关资讯

博弈论视角下的多智能体强化学习综述,129页pdf与76页Slides

博弈论视角下的多智能体强化学习综述,129页pdf与76页Slides

专知

11+阅读 · 2022年11月26日

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

《可解释人工智能在多域作战中的智能增强》美国陆军、IBM、卡迪夫大学等论文

《可解释人工智能在多域作战中的智能增强》美国陆军、IBM、卡迪夫大学等论文

专知

67+阅读 · 2022年11月2日

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

专知

15+阅读 · 2022年7月22日

「博弈论视角下多智能体强化学习」研究综述

「博弈论视角下多智能体强化学习」研究综述

专知

58+阅读 · 2022年4月30日

【综述】多智能体强化学习算法理论研究

【综述】多智能体强化学习算法理论研究

深度强化学习实验室

15+阅读 · 2020年9月9日

多智能体强化学习（MARL）近年研究概览

多智能体强化学习（MARL）近年研究概览

PaperWeekly

38+阅读 · 2020年3月15日

DeepMind：用PopArt进行多任务深度强化学习

DeepMind：用PopArt进行多任务深度强化学习

论智

29+阅读 · 2018年9月14日

DeepMind无监督表示学习重大突破：语音、图像、文本、强化学习全能冠军！

DeepMind无监督表示学习重大突破：语音、图像、文本、强化学习全能冠军！

新智元

12+阅读 · 2018年7月13日

【学界】谷歌大脑提出自动数据增强方法AutoAugment：可迁移至不同数据集

【学界】谷歌大脑提出自动数据增强方法AutoAugment：可迁移至不同数据集

GAN生成式对抗网络

11+阅读 · 2018年6月5日

相关论文

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

Arxiv

0+阅读 · 2月13日

Bayesian Ego-graph Inference for Networked Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2月13日

LongNav-R1: Horizon-Adaptive Multi-Turn RL for Long-Horizon VLA Navigation

Arxiv

0+阅读 · 2月12日

Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements

Arxiv

0+阅读 · 2月11日

DHEA-MECD: An Embodied Intelligence-Powered DRL Algorithm for AUV Tracking in Underwater Environments with High-Dimensional Features

Arxiv

0+阅读 · 2月8日

TodoEvolve: Learning to Architect Agent Planning Systems

Arxiv

0+阅读 · 2月8日

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Arxiv

0+阅读 · 2月3日

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Arxiv

0+阅读 · 1月26日

Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control

Arxiv

0+阅读 · 1月21日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

42+阅读 · 2015年12月31日

面向水电开发争端预警的国际河流流域空间数据知识组织研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

具有动态不确定性的下三角多智能体系统分布式自适应协同控制

国家自然科学基金

3+阅读 · 2015年12月31日

噪声不确定下基于计算智能的多跳认知无线电网络协作频谱感知优化

国家自然科学基金

0+阅读 · 2015年12月31日

带有通信量化和延时的多智能体系统一致性研究

国家自然科学基金

0+阅读 · 2014年12月31日

短波认知ALE系统中基于深度学习-GP混合模型的多维谱预测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员