Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges

Software that contains machine learning algorithms is an integral part of automotive perception, for example, in driving automation systems. The development of such software, specifically the training and validation of the machine learning components, require large annotated datasets. An industry of data and annotation services has emerged to serve the development of such data-intensive automotive software components. Wide-spread difficulties to specify data and annotation needs challenge collaborations between OEMs (Original Equipment Manufacturers) and their suppliers of software components, data, and annotations. This paper investigates the reasons for these difficulties for practitioners in the Swedish automotive industry to arrive at clear specifications for data and annotations. The results from an interview study show that a lack of effective metrics for data quality aspects, ambiguities in the way of working, unclear definitions of annotation quality, and deficits in the business ecosystems are causes for the difficulty in deriving the specifications. We provide a list of recommendations that can mitigate challenges when deriving specifications and we propose future research opportunities to overcome these challenges. Our work contributes towards the on-going research on accountability of machine learning as applied to complex software systems, especially for high-stake applications such as automated driving.

翻译：包含机器学习算法的软件是汽车感知系统不可或缺的组成部分，例如在驾驶自动化系统中。此类软件的开发，特别是机器学习组件的训练与验证，需要大规模标注数据集。为满足数据密集型汽车软件组件的开发需求，数据与标注服务行业应运而生。然而，在数据与标注需求的规范化方面普遍存在的困难，给原始设备制造商（OEM）及其软件组件、数据与标注供应商之间的协作带来了挑战。本文以瑞典汽车行业从业者为对象，探究了其在制定明确数据与标注规范时面临困难的根源。访谈研究结果表明：有效数据质量度量指标的缺乏、工作方式的模糊性、标注质量定义的不明确以及商业生态系统中的缺陷，是导致规范制定困难的主要原因。我们提出了一系列缓解规范制定难题的建议，并指出了克服这些挑战的未来研究方向。本研究有助于推进对复杂软件系统中机器学习可问责性的持续探究，尤其适用于自动驾驶等高利害应用场景。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日