What Matters to Enhance Traffic Rule Compliance of Imitation Learning for Automated Driving

More research attention has recently been given to end-to-end autonomous driving technologies where the entire driving pipeline is replaced with a single neural network because of its simpler structure and faster inference time. Despite this appealing approach largely reducing the components in the driving pipeline, its simplicity also leads to interpretability problems and safety issues. The trained policy is not always compliant with the traffic rules and it is also hard to discover the reason for the misbehavior because of the lack of intermediate outputs. Meanwhile, sensors are also critical to autonomous driving's security and feasibility to perceive the surrounding environment under complex driving scenarios. In this paper, we proposed P-CSG, a penalty-based imitation learning approach with cross semantics generation sensor fusion technologies to increase the overall performance of end-to-end autonomous driving. In this method, we introduce three penalties - red light, stop sign, and curvature speed penalty to make the agent more sensitive to traffic rules. The proposed cross semantics generation helps to align the shared information from different input modalities. We assessed our model's performance using the CARLA leaderboard - Town 05 Long benchmark and Longest6 Benchmark, achieving an impressive driving score improvement. Furthermore, we conducted robustness evaluations against adversarial attacks like FGSM and Dot attacks, revealing a substantial increase in robustness compared to baseline models. More detailed information, such as code base resources, and videos can be found at https://hk-zh.github.io/p-csg-plus.

翻译：近年来，端到端自动驾驶技术因其结构简单、推理速度快而受到更多研究关注，该技术用单个神经网络替代了整个驾驶流程。尽管这种引人注目的方法大幅减少了驾驶流程中的组件，但其简洁性也带来了可解释性问题与安全隐患。训练得到的策略并非始终遵守交通规则，且由于缺乏中间输出，难以发现行为异常的原因。同时，传感器对于自动驾驶在复杂驾驶场景中感知周围环境的安全性和可行性也至关重要。本文提出P-CSG——一种基于惩罚的模仿学习方法，结合跨语义生成传感器融合技术，以提升端到端自动驾驶的整体性能。该方法引入三种惩罚机制——红灯惩罚、停车标志惩罚和弯道速度惩罚，使智能体对交通规则更加敏感。所提出的跨语义生成有助于对齐来自不同输入模态的共享信息。我们使用CARLA排行榜的Town 05 Long基准测试和Longest6基准测试评估模型性能，取得了显著的驾驶分数提升。此外，我们针对FGSM和Dot攻击等对抗性攻击进行了鲁棒性评估，结果显示与基线模型相比鲁棒性大幅提升。更多详细信息（如代码库资源和视频）可访问https://hk-zh.github.io/p-csg-plus。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日