In this work, we introduce Scribbles for All, a label and training data generation algorithm for semantic segmentation trained on scribble labels. Training or fine-tuning semantic segmentation models with weak supervision has become an important topic recently and was subject to significant advances in model quality. In this setting, scribbles are a promising label type to achieve high quality segmentation results while requiring a much lower annotation effort than usual pixel-wise dense semantic segmentation annotations. The main limitation of scribbles as source for weak supervision is the lack of challenging datasets for scribble segmentation, which hinders the development of novel methods and conclusive evaluations. To overcome this limitation, Scribbles for All provides scribble labels for several popular segmentation datasets and provides an algorithm to automatically generate scribble labels for any dataset with dense annotations, paving the way for new insights and model advancements in the field of weakly supervised segmentation. In addition to providing datasets and algorithm, we evaluate state-of-the-art segmentation models on our datasets and show that models trained with our synthetic labels perform competitively with respect to models trained on manual labels. Thus, our datasets enable state-of-the-art research into methods for scribble-labeled semantic segmentation. The datasets, scribble generation algorithm, and baselines are publicly available at https://github.com/wbkit/Scribbles4All
翻译:本研究提出Scribbles for All——一种基于涂鸦标注训练语义分割模型的标签与训练数据生成算法。近年来,利用弱监督训练或微调语义分割模型已成为重要研究方向,并在模型质量方面取得显著进展。在此背景下,涂鸦作为一种标注形式,能以远低于常规像素级密集语义标注的标注成本获得高质量分割结果。涂鸦作为弱监督源的主要局限在于缺乏具有挑战性的涂鸦分割数据集,这阻碍了新方法的开发与结论性评估。为突破此限制,Scribbles for All为多个主流分割数据集提供涂鸦标注,并提出可自动为任何具有密集标注的数据集生成涂鸦标签的算法,为弱监督分割领域的新发现与模型进展铺平道路。除提供数据集与算法外,我们在自建数据集上评估了前沿分割模型,结果表明使用合成标签训练的模型与人工标注训练的模型相比具有竞争力。因此,本数据集能够支持基于涂鸦标注的语义分割方法的前沿研究。数据集、涂鸦生成算法及基线模型已在https://github.com/wbkit/Scribbles4All 公开。