Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control

Attention mechanisms excel at learning sequential patterns by discriminating data based on relevance and importance. This provides state-of-the-art performance in advanced generative artificial intelligence models. This paper applies this concept of an attention mechanism for multi-agent safe control. We specifically consider the design of a neural network to control autonomous vehicles in a highway merging scenario. The environment is modeled as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP). Within a QMIX framework, we include partial attention for each autonomous vehicle, thus allowing each ego vehicle to focus on the most relevant neighboring vehicles. Moreover, we propose a comprehensive reward signal that considers the global objectives of the environment (e.g., safety and vehicle flow) and the individual interests of each agent. Simulations are conducted in the Simulation of Urban Mobility (SUMO). The results show better performance compared to other driving algorithms in terms of safety, driving speed, and reward.

翻译：注意力机制通过根据数据的相关性和重要性对其进行区分，在序列模式学习方面表现出色。这为先进生成式人工智能模型提供了最先进的性能。本文将该注意力机制的概念应用于多智能体安全控制。我们特别考虑了在高速公路合流场景中设计用于控制自动驾驶汽车的神经网络。环境被建模为分散式部分可观测马尔可夫决策过程（Dec-POMDP）。在QMIX框架内，我们为每辆自动驾驶汽车引入部分注意力，从而允许每辆自车关注最相关的邻近车辆。此外，我们提出了一种综合考虑环境全局目标（例如安全性和车辆流量）以及每个智能体个体利益的综合奖励信号。在“城市移动性仿真”（SUMO）中进行了仿真实验。结果表明，在安全性、行驶速度和奖励方面，该方法优于其他驾驶算法。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

《多智能体强化学习中的机制设计优化研究》103页

专知会员服务

33+阅读 · 2025年5月31日

《多智能体强化学习中机制设计的优化》103页

专知会员服务

31+阅读 · 2025年5月3日

扩散模型中的注意力机制：综述

专知会员服务

24+阅读 · 2025年4月10日

基于学习机制的多智能体强化学习综述

专知会员服务

63+阅读 · 2024年4月16日