Real-time Cooperative Vehicle Coordination at Unsignalized Road Intersections

Cooperative coordination at unsignalized road intersections, which aims to improve the driving safety and traffic throughput for connected and automated vehicles, has attracted increasing interests in recent years. However, most existing investigations either suffer from computational complexity or cannot harness the full potential of the road infrastructure. To this end, we first present a dedicated intersection coordination framework, where the involved vehicles hand over their control authorities and follow instructions from a centralized coordinator. Then a unified cooperative trajectory optimization problem will be formulated to maximize the traffic throughput while ensuring the driving safety and long-term stability of the coordination system. To address the key computational challenges in the real-world deployment, we reformulate this non-convex sequential decision problem into a model-free Markov Decision Process (MDP) and tackle it by devising a Twin Delayed Deep Deterministic Policy Gradient (TD3)-based strategy in the deep reinforcement learning (DRL) framework. Simulation and practical experiments show that the proposed strategy could achieve near-optimal performance in sub-static coordination scenarios and significantly improve the traffic throughput in the realistic continuous traffic flow. The most remarkable advantage is that our strategy could reduce the time complexity of computation to milliseconds, and is shown scalable when the road lanes increase.

翻译：无信号灯路口的协同协调旨在提升联网自动驾驶车辆的驾驶安全与通行效率，近年来受到越来越多关注。然而，现有研究大多面临计算复杂度过高的问题，或未能充分利用道路基础设施的潜力。为此，本文首先提出一种专用路口协调框架，其中涉及车辆移交控制权限并遵循集中式协调器的指令。随后，我们构建统一的协同轨迹优化问题，以在确保驾驶安全与协调系统长期稳定性的前提下最大化通行效率。为解决实际部署中的关键计算挑战，我们将这一非凸序贯决策问题重构为无模型马尔可夫决策过程（MDP），并基于深度强化学习（DRL）框架设计了一种采用双延迟深度确定性策略梯度（TD3）的求解策略。仿真与实验结果表明，该策略在准静态协调场景下可达到接近最优的性能，并在真实连续交通流中显著提升通行效率。其最显著的优势在于能将计算时间复杂度降至毫秒级，且随道路车道数增加仍具备良好的可扩展性。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

167+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

24+阅读 · 2022年3月19日

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

专知会员服务

17+阅读 · 2022年2月16日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

41+阅读 · 2020年9月21日