Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation for autonomous vehicles

Autonomous driving (AD) perception today relies heavily on deep learning based architectures requiring large scale annotated datasets with their associated costs for curation and annotation. The 3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization. We propose a new dataset, Navya 3D Segmentation (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain, including rural, urban, industrial sites and universities from 13 countries. It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds. We also propose a novel method for sequential dataset split generation based on iterative multi-label stratification, and demonstrated to achieve a +1.2% mIoU improvement over the original split proposed by SemanticKITTI dataset. A complete benchmark for semantic segmentation task was performed, with state of the art methods. Finally, we demonstrate an active learning (AL) based dataset distillation framework. We introduce a novel heuristic-free sampling method called distance sampling in the context of AL. A detailed presentation on the dataset is available at https://www.youtube.com/watch?v=5m6ALIs-s20 .

翻译：自主驾驶感知技术当前高度依赖基于深度学习的架构，此类架构需要大规模标注数据集，并伴随相应的数据整理与标注成本。三维语义数据在障碍物检测、自车定位等核心感知任务中具有重要作用。我们提出一个新数据集Navya三维分割（Navya3DSeg），其包含丰富的标签空间，覆盖来自13个国家的乡村、城区、工业园区及大学等大规模生产级运营场景。该数据集包含23个已标注序列与25个无标注补充序列，旨在探索基于点云的自监督与半监督语义分割基准测试。我们还提出一种基于迭代多标签分层策略的序列数据集分割生成新方法，实验表明该方法相较于SemanticKITTI数据集原始分割方案可获得平均交并比（mIoU）1.2%的提升。我们利用现有最优方法完成了完整的语义分割任务基准测试。最后，我们提出一种基于主动学习的数据集精炼框架，并在该框架中引入一种名为距离采样的无启发式采样方法。数据集详细介绍可参阅 https://www.youtube.com/watch?v=5m6ALIs-s20。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【CVPR 2022】基于Tracklet查询和建议的高效视频实例分割，Efficient Video Instance Segmentation via Tracklet Query and Proposal

专知会员服务

16+阅读 · 2022年3月3日