Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections

In this work, we present a reward-driven automated curriculum reinforcement learning approach for interaction-aware self-driving at unsignalized intersections, taking into account the uncertainties associated with surrounding vehicles (SVs). These uncertainties encompass the uncertainty of SVs' driving intention and also the quantity of SVs. To deal with this problem, the curriculum set is specifically designed to accommodate a progressively increasing number of SVs. By implementing an automated curriculum selection mechanism, the importance weights are rationally allocated across various curricula, thereby facilitating improved sample efficiency and training outcomes. Furthermore, the reward function is meticulously designed to guide the agent towards effective policy exploration. Thus the proposed framework could proactively address the above uncertainties at unsignalized intersections by employing the automated curriculum learning technique that progressively increases task difficulty, and this ensures safe self-driving through effective interaction with SVs. Comparative experiments are conducted in $Highway\_Env$, and the results indicate that our approach achieves the highest task success rate, attains strong robustness to initialization parameters of the curriculum selection module, and exhibits superior adaptability to diverse situational configurations at unsignalized intersections. Furthermore, the effectiveness of the proposed method is validated using the high-fidelity CARLA simulator.

翻译：在本文中，我们提出了一种奖励驱动的自动课程强化学习方法，用于无信号交叉口的交互感知自动驾驶，该方法考虑了周围车辆（SVs）相关的不确定性。这些不确定性包括SVs驾驶意图的不确定性以及SVs数量的不确定性。为解决该问题，课程集被专门设计为容纳逐渐增多的SVs数量。通过实施自动课程选择机制，重要性权重在不同课程间得到合理分配，从而提升了样本效率和训练效果。此外，奖励函数被精心设计以引导智能体进行有效的策略探索。因此，所提出的框架能够通过采用任务难度逐步增加的自动课程学习技术，主动应对无信号交叉口的上述不确定性，并通过与SVs的有效交互确保安全自动驾驶。在$Highway\_Env$中进行了对比实验，结果表明，我们的方法取得了最高的任务成功率，对课程选择模块的初始化参数具有强鲁棒性，并在无信号交叉口的多种情景配置下展现出优越的适应性。此外，通过高保真度CARLA模拟器验证了所提方法的有效性。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日