V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting

Haibao Yu,Wenxian Yang,Hongzhi Ruan,Zhenwei Yang,Yingjuan Tang,Xu Gao,Xin Hao,Yifeng Shi,Yifeng Pan,Ning Sun,Juan Song,Jirui Yuan,Ping Luo,Zaiqing Nie

from arxiv, CVPR2023

Utilizing infrastructure and vehicle-side information to track and forecast the behaviors of surrounding traffic participants can significantly improve decision-making and safety in autonomous driving. However, the lack of real-world sequential datasets limits research in this area. To address this issue, we introduce V2X-Seq, the first large-scale sequential V2X dataset, which includes data frames, trajectories, vector maps, and traffic lights captured from natural scenery. V2X-Seq comprises two parts: the sequential perception dataset, which includes more than 15,000 frames captured from 95 scenarios, and the trajectory forecasting dataset, which contains about 80,000 infrastructure-view scenarios, 80,000 vehicle-view scenarios, and 50,000 cooperative-view scenarios captured from 28 intersections' areas, covering 672 hours of data. Based on V2X-Seq, we introduce three new tasks for vehicle-infrastructure cooperative (VIC) autonomous driving: VIC3D Tracking, Online-VIC Forecasting, and Offline-VIC Forecasting. We also provide benchmarks for the introduced tasks. Find data, code, and more up-to-date information at \href{https://github.com/AIR-THU/DAIR-V2X-Seq}{https://github.com/AIR-THU/DAIR-V2X-Seq}.

翻译：利用基础设施和车辆侧信息跟踪与预测周围交通参与者的行为，可显著提升自动驾驶的决策能力和安全性。然而，真实世界序列数据集的匮乏限制了该领域的研究进展。针对这一问题，我们提出了V2X-Seq——首个大规模序列车路协同数据集，包含从自然场景中采集的数据帧、轨迹、矢量地图及交通灯信息。V2X-Seq由两部分组成：序列感知数据集（涵盖95个场景的15,000余帧数据）和轨迹预测数据集（包含从28个路口区域采集的约80,000个基础设施视角场景、80,000个车辆视角场景及50,000个协同视角场景，覆盖672小时数据）。基于V2X-Seq，我们提出了车路协同自动驾驶的三项新任务：VIC3D跟踪、在线VIC预测和离线VIC预测，并提供了相应基准。数据和代码及更多最新信息详见 \href{https://github.com/AIR-THU/DAIR-V2X-Seq}{https://github.com/AIR-THU/DAIR-V2X-Seq}。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日