Semantic segmentation is a common task in autonomous driving to understand the surrounding environment. Driveable Area Segmentation and Lane Detection are particularly important for safe and efficient navigation on the road. However, original semantic segmentation models are computationally expensive and require high-end hardware, which is not feasible for embedded systems in autonomous vehicles. This paper proposes a lightweight model for the driveable area and lane line segmentation. TwinLiteNet is designed cheaply but achieves accurate and efficient segmentation results. We evaluate TwinLiteNet on the BDD100K dataset and compare it with modern models. Experimental results show that our TwinLiteNet performs similarly to existing approaches, requiring significantly fewer computational resources. Specifically, TwinLiteNet achieves a mIoU score of 91.3% for the Drivable Area task and 31.08% IoU for the Lane Detection task with only 0.4 million parameters and achieves 415 FPS on GPU RTX A5000. Furthermore, TwinLiteNet can run in real-time on embedded devices with limited computing power, especially since it achieves 60FPS on Jetson Xavier NX, making it an ideal solution for self-driving vehicles. Code is available: url{https://github.com/chequanghuy/TwinLiteNet}.
翻译:语义分割是自动驾驶中理解周围环境的常见任务。其中,可行驶区域分割与车道检测对于道路上的安全高效导航尤为重要。然而,传统语义分割模型计算成本高且需要高端硬件,难以在自动驾驶汽车的嵌入式系统中实现。本文提出一种针对可行驶区域与车道线分割的轻量级模型。TwinLiteNet 设计廉价但能实现精确高效的分割结果。我们在 BDD100K 数据集上评估 TwinLiteNet,并与现代模型进行对比。实验结果表明,TwinLiteNet 与现有方法性能相当,但所需计算资源显著减少。具体而言,TwinLiteNet 在可行驶区域任务上达到 91.3% 的 mIoU 分数,在车道检测任务上达到 31.08% 的 IoU,仅需 0.4 百万参数,在 GPU RTX A5000 上实现 415 FPS。此外,TwinLiteNet 能在计算能力有限的嵌入式设备上实时运行,尤其是在 Jetson Xavier NX 上达到 60 FPS,使其成为自动驾驶汽车的理想解决方案。代码已开源:url{https://github.com/chequanghuy/TwinLiteNet}。