DrivIng：一个具有完整数字孪生集成的大规模多模态驾驶数据集 (DrivIng: A Large-Scale Multimodal Driving Dataset with Full Digital Twin Integration)

Perception is a cornerstone of autonomous driving, enabling vehicles to understand their surroundings and make safe, reliable decisions. Developing robust perception algorithms requires large-scale, high-quality datasets that cover diverse driving conditions and support thorough evaluation. Existing datasets often lack a high-fidelity digital twin, limiting systematic testing, edge-case simulation, sensor modification, and sim-to-real evaluations. To address this gap, we present DrivIng, a large-scale multimodal dataset with a complete geo-referenced digital twin of a ~18 km route spanning urban, suburban, and highway segments. Our dataset provides continuous recordings from six RGB cameras, one LiDAR, and high-precision ADMA-based localization, captured across day, dusk, and night. All sequences are annotated at 10 Hz with 3D bounding boxes and track IDs across 12 classes, yielding ~1.2 million annotated instances. Alongside the benefits of a digital twin, DrivIng enables a 1-to-1 transfer of real traffic into simulation, preserving agent interactions while enabling realistic and flexible scenario testing. To support reproducible research and robust validation, we benchmark DrivIng with state-of-the-art perception models and publicly release the dataset, digital twin, HD map, and codebase.

翻译：感知是自动驾驶的基石，它使车辆能够理解周围环境并做出安全可靠的决策。开发鲁棒的感知算法需要大规模、高质量的数据集，这些数据集应涵盖多样化的驾驶条件并支持全面的评估。现有数据集通常缺乏高保真度的数字孪生，限制了系统性测试、边缘案例模拟、传感器修改以及仿真到现实的评估。为弥补这一不足，我们提出了DrivIng，这是一个大规模多模态数据集，包含一条约18公里路线（涵盖城市、郊区和高速公路路段）的完整地理参考数字孪生。我们的数据集提供了来自六个RGB摄像头、一个激光雷达以及基于高精度ADMA的定位系统的连续记录，采集时间覆盖白天、黄昏和夜晚。所有序列均以10 Hz的频率进行了标注，包含12个类别的3D边界框和轨迹ID，产生了约120万个标注实例。除了数字孪生的优势外，DrivIng还能将真实交通1:1地迁移到仿真环境中，在保留智能体交互的同时，实现逼真且灵活的场景测试。为支持可重复的研究和鲁棒的验证，我们使用最先进的感知模型对DrivIng进行了基准测试，并公开了数据集、数字孪生、高精地图和代码库。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

自动驾驶中的轨迹预测大型基础模型：全面综述

专知会员服务

16+阅读 · 2025年9月18日

自动驾驶中的3D目标检测研究进展

专知会员服务

11+阅读 · 2025年7月20日

【HKUST博士论文】可扩展的基于视觉的 3D 物体检测与单目深度估计用于自动驾驶

专知会员服务

18+阅读 · 2025年1月20日

【伯克利博士论文】高效的自动驾驶3D视觉，108页pdf

专知会员服务

24+阅读 · 2024年9月1日