Multi-Agent Path Finding (MAPF) in crowded environments presents a challenging problem in motion planning, aiming to find collision-free paths for all agents in the system. MAPF finds a wide range of applications in various domains, including aerial swarms, autonomous warehouse robotics, and self-driving vehicles. Current approaches to MAPF generally fall into two main categories: centralized and decentralized planning. Centralized planning suffers from the curse of dimensionality when the number of agents or states increases and thus does not scale well in large and complex environments. On the other hand, decentralized planning enables agents to engage in real-time path planning within a partially observable environment, demonstrating implicit coordination. However, they suffer from slow convergence and performance degradation in dense environments. In this paper, we introduce CRAMP, a novel crowd-aware decentralized reinforcement learning approach to address this problem by enabling efficient local communication among agents via Graph Neural Networks (GNNs), facilitating situational awareness and decision-making capabilities in congested environments. We test CRAMP on simulated environments and demonstrate that our method outperforms the state-of-the-art decentralized methods for MAPF on various metrics. CRAMP improves the solution quality up to 59% measured in makespan and collision count, and up to 35% improvement in success rate in comparison to previous methods.
翻译:在拥挤环境下的多智能体路径规划(MAPF)是运动规划领域的一个具有挑战性的问题,其目标是为系统中的所有智能体寻找无碰撞路径。MAPF在多个领域有着广泛的应用,包括无人机集群、自主仓储机器人和自动驾驶车辆。当前的MAPF方法主要分为两大类:集中式规划和分散式规划。当智能体数量或状态空间增加时,集中式规划会遭受维度灾难,因此在大型复杂环境中扩展性不佳。另一方面,分散式规划使智能体能够在部分可观测环境中进行实时路径规划,展现出隐式的协调能力。然而,在密集环境中,它们存在收敛速度慢和性能下降的问题。本文提出了CRAMP,一种新颖的群体感知分散式强化学习方法,通过利用图神经网络(GNNs)实现智能体间高效的局部通信,来解决这一问题,从而增强在拥堵环境中的态势感知和决策能力。我们在模拟环境中测试了CRAMP,并证明我们的方法在各种指标上优于最先进的MAPF分散式方法。与先前方法相比,CRAMP在完工时间和碰撞次数方面将解的质量提升了高达59%,在成功率方面提升了高达35%。