潜在空间强化学习在多机器人探索中的应用 (Latent Space Reinforcement Learning for Multi-Robot Exploration) - 专知论文

会员服务 ·

0

潜在 · 算法 · 系统 · 结构 · 自编码器 ·

Latent Space Reinforcement Learning for Multi-Robot Exploration

翻译：潜在空间强化学习在多机器人探索中的应用

Sriram Rajasekar,Ashwini Ratnoo

Autonomous mapping of unknown environments is a critical challenge, particularly in scenarios where time is limited. Multi-agent systems can enhance efficiency through collaboration, but the scalability of motion-planning algorithms remains a key limitation. Reinforcement learning has been explored as a solution, but existing approaches are constrained by the limited input size required for effective learning, restricting their applicability to discrete environments. This work addresses that limitation by leveraging autoencoders to perform dimensionality reduction, compressing high-fidelity occupancy maps into latent state vectors while preserving essential spatial information. Additionally, we introduce a novel procedural generation algorithm based on Perlin noise, designed to generate topologically complex training environments that simulate asteroid fields, caves and forests. These environments are used for training the autoencoder and the navigation algorithm using a hierarchical deep reinforcement learning framework for decentralized coordination. We introduce a weighted consensus mechanism that modulates reliance on shared data via a tuneable trust parameter, ensuring robustness to accumulation of errors. Experimental results demonstrate that the proposed system scales effectively with number of agents and generalizes well to unfamiliar, structurally distinct environments and is resilient in communication-constrained settings.

翻译：未知环境的自主测绘是一个关键挑战，在时间受限的场景中尤为如此。多智能体系统可通过协作提升效率，但运动规划算法的可扩展性仍是主要限制因素。强化学习已被探索作为解决方案，但现有方法受限于有效学习所需的有限输入规模，致使其仅适用于离散环境。本研究通过利用自编码器进行降维来解决这一局限，将高保真占据栅格地图压缩为潜在状态向量，同时保留关键空间信息。此外，我们提出一种基于Perlin噪声的新型程序化生成算法，旨在生成拓扑结构复杂的训练环境，模拟小行星带、洞穴和森林等场景。这些环境用于训练自编码器及导航算法，并采用分层深度强化学习框架实现去中心化协同。我们引入一种加权共识机制，通过可调节的信任参数调控对共享数据的依赖程度，确保对误差累积的鲁棒性。实验结果表明，所提出的系统能随智能体数量有效扩展，对结构迥异的陌生环境具有良好的泛化能力，且在通信受限场景中表现出强韧性。

0

相关内容

机器人领域的多任务泛化研究

机器人领域的多任务泛化研究

专知会员服务

16+阅读 · 1月14日

《基于分层多智能体强化学习的逼真空战协同策略》

《基于分层多智能体强化学习的逼真空战协同策略》

专知会员服务

39+阅读 · 2025年10月30日

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

专知会员服务

43+阅读 · 2025年9月23日

Nature：大脑中的多时间尺度强化学习

Nature：大脑中的多时间尺度强化学习

专知会员服务

17+阅读 · 2025年6月8日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

分层强化学习在无人机领域应用综述

分层强化学习在无人机领域应用综述

专知会员服务

53+阅读 · 2024年3月19日

《用于空战机动的分层多智能体强化学习》

《用于空战机动的分层多智能体强化学习》

专知会员服务

66+阅读 · 2023年10月5日

「强化学习在无人车领域」的应用与展望

「强化学习在无人车领域」的应用与展望

专知会员服务

58+阅读 · 2022年12月8日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知会员服务

37+阅读 · 2022年7月12日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

推荐！【DARPA终身学习机器（L2M）】《自主系统中用于感知和行动的终身学习》美空军、宾大2022最新234页技术报告

推荐！【DARPA终身学习机器（L2M）】《自主系统中用于感知和行动的终身学习》美空军、宾大2022最新234页技术报告

专知

26+阅读 · 2022年11月24日

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

DeepMind综述深度强化学习中的快与慢，智能体应该像人一样学习

DeepMind综述深度强化学习中的快与慢，智能体应该像人一样学习

机器之心

20+阅读 · 2019年5月3日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

【强化学习】叶志豪：介绍强化学习及其在 NLP 上的应用｜分享总结

【强化学习】叶志豪：介绍强化学习及其在 NLP 上的应用｜分享总结

产业智能官

20+阅读 · 2018年7月24日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习初探 - 从多臂老虎机问题说起

强化学习初探 - 从多臂老虎机问题说起

专知

10+阅读 · 2018年4月3日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

42+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

未知环境中移动机器人探索式路径规划方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Learning Reward Functions for Cooperative Resilience in Multi-Agent Systems

Arxiv

0+阅读 · 1月29日

Adapting the Behavior of Reinforcement Learning Agents to Changing Action Spaces and Reward Functions

Arxiv

0+阅读 · 1月28日

Continual Knowledge Adaptation for Reinforcement Learning

Arxiv

0+阅读 · 1月20日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

Cooperative Multi-agent RL with Communication Constraints

Arxiv

0+阅读 · 1月18日

Heterogeneous Multi-Expert Reinforcement Learning for Long-Horizon Multi-Goal Tasks in Autonomous Forklifts

Arxiv

0+阅读 · 1月12日

Solving Robotics Tasks with Prior Demonstration via Exploration-Efficient Deep Reinforcement Learning

Arxiv

0+阅读 · 1月8日

Multiagent Reinforcement Learning with Neighbor Action Estimation

Arxiv

0+阅读 · 1月8日

Hybrid Motion Planning with Deep Reinforcement Learning for Mobile Robot Navigation

Arxiv

0+阅读 · 2025年12月31日

VIP会员

文章信息

相关主题

相关VIP内容

机器人领域的多任务泛化研究

机器人领域的多任务泛化研究

专知会员服务

16+阅读 · 1月14日

《基于分层多智能体强化学习的逼真空战协同策略》

《基于分层多智能体强化学习的逼真空战协同策略》

专知会员服务

39+阅读 · 2025年10月30日

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

专知会员服务

43+阅读 · 2025年9月23日

Nature：大脑中的多时间尺度强化学习

Nature：大脑中的多时间尺度强化学习

专知会员服务

17+阅读 · 2025年6月8日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

分层强化学习在无人机领域应用综述

分层强化学习在无人机领域应用综述

专知会员服务

53+阅读 · 2024年3月19日

《用于空战机动的分层多智能体强化学习》

《用于空战机动的分层多智能体强化学习》

专知会员服务

66+阅读 · 2023年10月5日

「强化学习在无人车领域」的应用与展望

「强化学习在无人车领域」的应用与展望

专知会员服务

58+阅读 · 2022年12月8日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知会员服务

37+阅读 · 2022年7月12日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

美国防部门开始扩建金穹反导系统基础设施

《基于选择性深度神经网络分类的弹性无线通信》最新报告

《多域作战中融合网络、电子战与动能机动》

《在东欧磨砺反无人机技能》美陆军最新反无人机训练报告

相关资讯

推荐！【DARPA终身学习机器（L2M）】《自主系统中用于感知和行动的终身学习》美空军、宾大2022最新234页技术报告

推荐！【DARPA终身学习机器（L2M）】《自主系统中用于感知和行动的终身学习》美空军、宾大2022最新234页技术报告

专知

26+阅读 · 2022年11月24日

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

DeepMind综述深度强化学习中的快与慢，智能体应该像人一样学习

DeepMind综述深度强化学习中的快与慢，智能体应该像人一样学习

机器之心

20+阅读 · 2019年5月3日

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

【强化学习】用于真实机器人的高效深度强化学习算法、全面解读深度强化学习

产业智能官

16+阅读 · 2018年12月27日

【强化学习】叶志豪：介绍强化学习及其在 NLP 上的应用｜分享总结

【强化学习】叶志豪：介绍强化学习及其在 NLP 上的应用｜分享总结

产业智能官

20+阅读 · 2018年7月24日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习初探 - 从多臂老虎机问题说起

强化学习初探 - 从多臂老虎机问题说起

专知

10+阅读 · 2018年4月3日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

相关论文

Reinforcement Learning for Active Perception in Autonomous Navigation

Arxiv

0+阅读 · 2月1日

Learning Reward Functions for Cooperative Resilience in Multi-Agent Systems

Arxiv

0+阅读 · 1月29日

Adapting the Behavior of Reinforcement Learning Agents to Changing Action Spaces and Reward Functions

Arxiv

0+阅读 · 1月28日

Continual Knowledge Adaptation for Reinforcement Learning

Arxiv

0+阅读 · 1月20日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

Cooperative Multi-agent RL with Communication Constraints

Arxiv

0+阅读 · 1月18日

Heterogeneous Multi-Expert Reinforcement Learning for Long-Horizon Multi-Goal Tasks in Autonomous Forklifts

Arxiv

0+阅读 · 1月12日

Solving Robotics Tasks with Prior Demonstration via Exploration-Efficient Deep Reinforcement Learning

Arxiv

0+阅读 · 1月8日

Multiagent Reinforcement Learning with Neighbor Action Estimation

Arxiv

0+阅读 · 1月8日

Hybrid Motion Planning with Deep Reinforcement Learning for Mobile Robot Navigation

Arxiv

0+阅读 · 2025年12月31日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

42+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

未知环境中移动机器人探索式路径规划方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员