Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous driving

Existing datasets for autonomous driving (AD) often lack diversity and long-range capabilities, focusing instead on 360{\deg} perception and temporal reasoning. To address this gap, we introduce Zenseact Open Dataset (ZOD), a large-scale and diverse multimodal dataset collected over two years in various European countries, covering an area 9x that of existing datasets. ZOD boasts the highest range and resolution sensors among comparable datasets, coupled with detailed keyframe annotations for 2D and 3D objects (up to 245m), road instance/semantic segmentation, traffic sign recognition, and road classification. We believe that this unique combination will facilitate breakthroughs in long-range perception and multi-task learning. The dataset is composed of Frames, Sequences, and Drives, designed to encompass both data diversity and support for spatio-temporal learning, sensor fusion, localization, and mapping. Frames consist of 100k curated camera images with two seconds of other supporting sensor data, while the 1473 Sequences and 29 Drives include the entire sensor suite for 20 seconds and a few minutes, respectively. ZOD is the only large-scale AD dataset released under a permissive license, allowing for both research and commercial use. The dataset is accompanied by an extensive development kit. Data and more information are available online (https://zod.zenseact.com).

翻译：现有的自动驾驶数据集往往缺乏多样性和远距离感知能力，而是聚焦于360度感知和时间推理。为填补这一空白，我们提出了Zenseact开放数据集（ZOD），这是一个跨两年在多个欧洲国家收集的大规模多样化多模态数据集，覆盖面积是现有数据集的9倍。ZOD配备了同类数据集中最高射程和最高分辨率的传感器，并包含详细的2D和3D物体（最远达245米）关键帧标注、道路实例/语义分割、交通标志识别以及道路分类。我们相信，这一独特组合将推动远距离感知和多任务学习领域的突破。该数据集由帧（Frames）、序列（Sequences）和行程（Drives）三部分组成，旨在兼顾数据多样性并支持时空学习、传感器融合、定位与建图。帧包含10万张精选相机图像及两秒的辅助传感器数据；1473个序列和29个行程则分别包含20秒和数分钟的全套传感器数据。ZOD是唯一采用宽松许可协议发布的大规模自动驾驶数据集，允许研究及商业用途。该数据集附带丰富的开发工具包，数据及更多信息可通过在线渠道获取。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日