Traffic signal control is safety-critical for our daily life. Roughly one-quarter of road accidents in the U.S. happen at intersections due to problematic signal timing, urging the development of safety-oriented intersection control. However, existing studies on adaptive traffic signal control using reinforcement learning technologies have focused mainly on minimizing traffic delay but neglecting the potential exposure to unsafe conditions. We, for the first time, incorporate road safety standards as enforcement to ensure the safety of existing reinforcement learning methods, aiming toward operating intersections with zero collisions. We have proposed a safety-enhanced residual reinforcement learning method (SafeLight) and employed multiple optimization techniques, such as multi-objective loss function and reward shaping for better knowledge integration. Extensive experiments are conducted using both synthetic and real-world benchmark datasets. Results show that our method can significantly reduce collisions while increasing traffic mobility.
翻译:交通信号控制对我们的日常生活至关重要。在美国,约四分之一的道路交通事故发生在交叉路口,其原因是信号配时不当,这迫切要求发展以安全为导向的交叉路口控制方法。然而,现有利用强化学习技术的自适应交通信号控制研究主要侧重于最小化交通延误,而忽略了潜在的不安全风险。我们首次将道路安全标准作为约束条件融入现有强化学习方法,以确保其安全性,旨在实现零碰撞的交叉路口运行。我们提出了一种增强安全性的残差强化学习方法(SafeLight),并采用了多种优化技术,如多目标损失函数和奖励塑形,以实现更好的知识融合。基于合成数据集和真实世界基准数据集的大量实验表明,我们的方法能够在提升交通流动性的同时显著减少碰撞。