传感器到像素：基于图像强化学习的去中心化集群聚集 (Sensor to Pixels: Decentralized Swarm Gathering via Image-Based Reinforcement Learning) - 专知论文

会员服务 ·

0

智能体 · 强化学习 · 传感 · 提取 · 传感器 ·

Sensor to Pixels: Decentralized Swarm Gathering via Image-Based Reinforcement Learning

翻译：传感器到像素：基于图像强化学习的去中心化集群聚集

Yigal Koifman,Eran Iceland,Erez Koifman,Ariel Barel,Alfred M. Bruckstein

This study highlights the potential of image-based reinforcement learning methods for addressing swarm-related tasks. In multi-agent reinforcement learning, effective policy learning depends on how agents sense, interpret, and process inputs. Traditional approaches often rely on handcrafted feature extraction or raw vector-based representations, which limit the scalability and efficiency of learned policies concerning input order and size. In this work we propose an image-based reinforcement learning method for decentralized control of a multi-agent system, where observations are encoded as structured visual inputs that can be processed by Neural Networks, extracting its spatial features and producing novel decentralized motion control rules. We evaluate our approach on a multi-agent convergence task of agents with limited-range and bearing-only sensing that aim to keep the swarm cohesive during the aggregation. The algorithm's performance is evaluated against two benchmarks: an analytical solution proposed by Bellaiche and Bruckstein, which ensures convergence but progresses slowly, and VariAntNet, a neural network-based framework that converges much faster but shows medium success rates in hard constellations. Our method achieves high convergence, with a pace nearly matching that of VariAntNet. In some scenarios, it serves as the only practical alternative.

翻译：本研究凸显了基于图像的强化学习方法在解决集群相关任务方面的潜力。在多智能体强化学习中，有效的策略学习取决于智能体如何感知、解释和处理输入。传统方法通常依赖于手工特征提取或基于原始向量的表示，这限制了所学策略在输入顺序和大小方面的可扩展性和效率。在本工作中，我们提出了一种基于图像的强化学习方法，用于多智能体系统的去中心化控制，其中观测被编码为结构化视觉输入，可由神经网络处理，从而提取其空间特征并生成新颖的去中心化运动控制规则。我们在一个多智能体聚合任务上评估了我们的方法，该任务中的智能体具有有限距离和仅测向感知能力，旨在聚集过程中保持集群的凝聚力。该算法的性能通过两个基准进行评估：一是Bellaiche和Bruckstein提出的解析解，该解确保收敛但进展缓慢；二是VariAntNet，一种基于神经网络的框架，其收敛速度快得多，但在困难构型中成功率中等。我们的方法实现了高收敛性，其速度几乎与VariAntNet相当。在某些场景下，它是唯一可行的替代方案。

0

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

《基于图神经网络、深度强化学习与概率主题建模的战略对手建模》

《基于图神经网络、深度强化学习与概率主题建模的战略对手建模》

专知会员服务

27+阅读 · 2025年11月16日

面向视觉的强化学习综述

面向视觉的强化学习综述

专知会员服务

21+阅读 · 2025年8月12日

【斯坦福博士论文】数据高效的强化学习：在复杂环境中决定学习什么

【斯坦福博士论文】数据高效的强化学习：在复杂环境中决定学习什么

专知会员服务

41+阅读 · 2024年9月22日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

智能集群系统的强化学习方法综述

智能集群系统的强化学习方法综述

专知会员服务

83+阅读 · 2024年1月1日

强化学习如何因果化？看最新《因果强化学习》综述论文，39页pdf

强化学习如何因果化？看最新《因果强化学习》综述论文，39页pdf

专知会员服务

84+阅读 · 2023年2月7日

《用于控制、探索和安全的样本高效深度强化学习》里尔大学207页博士论文

《用于控制、探索和安全的样本高效深度强化学习》里尔大学207页博士论文

专知会员服务

38+阅读 · 2022年7月21日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【南洋理工大学课程】deep_reinforcement_learning（深度强化学习），109页ppt

【南洋理工大学课程】deep_reinforcement_learning（深度强化学习），109页ppt

专知会员服务

105+阅读 · 2019年11月2日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

图怎么用强化学习？东北大学最新《图强化学习》综述论文，54页pdf阐述GRL方法、数据与应用

图怎么用强化学习？东北大学最新《图强化学习》综述论文，54页pdf阐述GRL方法、数据与应用

专知

12+阅读 · 2022年4月14日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

专家报告 | 融合数据先验知识的智能图像增强

专家报告 | 融合数据先验知识的智能图像增强

中国图象图形学报

16+阅读 · 2020年5月25日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

专知

43+阅读 · 2019年7月10日

PlaNet 简介：用于强化学习的深度规划网络

PlaNet 简介：用于强化学习的深度规划网络

谷歌开发者

13+阅读 · 2019年3月16日

深度强化学习简介

深度强化学习简介

专知

30+阅读 · 2018年12月3日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于深度学习的多尺度本质图像提取方法

国家自然科学基金

5+阅读 · 2015年12月31日

面向物联网搜索的群智感知关键技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

9+阅读 · 2015年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

CARL: Focusing Agentic Reinforcement Learning on Critical Actions

Arxiv

0+阅读 · 2月5日

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Arxiv

0+阅读 · 2月3日

From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning

Arxiv

0+阅读 · 1月30日

Representation-Driven Reinforcement Learning

Arxiv

0+阅读 · 1月22日

Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling for Strategic Multiagent Settings

Arxiv

0+阅读 · 1月22日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching

Arxiv

0+阅读 · 1月15日

Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition

Arxiv

0+阅读 · 1月13日

Multiagent Reinforcement Learning with Neighbor Action Estimation

Arxiv

0+阅读 · 1月8日

Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images

Arxiv

0+阅读 · 1月7日

VIP会员

文章信息

相关主题

相关VIP内容

《基于图神经网络、深度强化学习与概率主题建模的战略对手建模》

《基于图神经网络、深度强化学习与概率主题建模的战略对手建模》

专知会员服务

27+阅读 · 2025年11月16日

面向视觉的强化学习综述

面向视觉的强化学习综述

专知会员服务

21+阅读 · 2025年8月12日

【斯坦福博士论文】数据高效的强化学习：在复杂环境中决定学习什么

【斯坦福博士论文】数据高效的强化学习：在复杂环境中决定学习什么

专知会员服务

41+阅读 · 2024年9月22日

基于学习机制的多智能体强化学习综述

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

智能集群系统的强化学习方法综述

智能集群系统的强化学习方法综述

专知会员服务

83+阅读 · 2024年1月1日

强化学习如何因果化？看最新《因果强化学习》综述论文，39页pdf

强化学习如何因果化？看最新《因果强化学习》综述论文，39页pdf

专知会员服务

84+阅读 · 2023年2月7日

《用于控制、探索和安全的样本高效深度强化学习》里尔大学207页博士论文

《用于控制、探索和安全的样本高效深度强化学习》里尔大学207页博士论文

专知会员服务

38+阅读 · 2022年7月21日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【南洋理工大学课程】deep_reinforcement_learning（深度强化学习），109页ppt

【南洋理工大学课程】deep_reinforcement_learning（深度强化学习），109页ppt

专知会员服务

105+阅读 · 2019年11月2日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机与战争：被忽视的环境影响及无人机保护潜力》

俄罗斯规划未来无人机驱动军队

《整合杀伤链：一个用于边缘目标验证与战术推理的零样本框架》最新资料

《人工智能、武器与影响力：前沿模型在模拟核危机中展现复杂推理》2026最新46页报告

相关资讯

「基于通信的多智能体强化学习」进展综述

「基于通信的多智能体强化学习」进展综述

专知

32+阅读 · 2022年11月12日

图怎么用强化学习？东北大学最新《图强化学习》综述论文，54页pdf阐述GRL方法、数据与应用

图怎么用强化学习？东北大学最新《图强化学习》综述论文，54页pdf阐述GRL方法、数据与应用

专知

12+阅读 · 2022年4月14日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

专家报告 | 融合数据先验知识的智能图像增强

专家报告 | 融合数据先验知识的智能图像增强

中国图象图形学报

16+阅读 · 2020年5月25日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

专知

43+阅读 · 2019年7月10日

PlaNet 简介：用于强化学习的深度规划网络

PlaNet 简介：用于强化学习的深度规划网络

谷歌开发者

13+阅读 · 2019年3月16日

深度强化学习简介

深度强化学习简介

专知

30+阅读 · 2018年12月3日

【强化学习】强化学习+深度学习=人工智能

【强化学习】强化学习+深度学习=人工智能

产业智能官

55+阅读 · 2017年8月11日

相关论文

CARL: Focusing Agentic Reinforcement Learning on Critical Actions

Arxiv

0+阅读 · 2月5日

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Arxiv

0+阅读 · 2月3日

From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning

Arxiv

0+阅读 · 1月30日

Representation-Driven Reinforcement Learning

Arxiv

0+阅读 · 1月22日

Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling for Strategic Multiagent Settings

Arxiv

0+阅读 · 1月22日

Communication Methods in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 1月19日

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching

Arxiv

0+阅读 · 1月15日

Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition

Arxiv

0+阅读 · 1月13日

Multiagent Reinforcement Learning with Neighbor Action Estimation

Arxiv

0+阅读 · 1月8日

Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images

Arxiv

0+阅读 · 1月7日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于深度学习的多尺度本质图像提取方法

国家自然科学基金

5+阅读 · 2015年12月31日

面向物联网搜索的群智感知关键技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

9+阅读 · 2015年12月31日

面向多源遥感图像的深度学习技术与系统研究

国家自然科学基金

17+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

48+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员