Data-centric artificial intelligence (data-centric AI) represents an emerging paradigm emphasizing that the systematic design and engineering of data is essential for building effective and efficient AI-based systems. The objective of this article is to introduce practitioners and researchers from the field of Information Systems (IS) to data-centric AI. We define relevant terms, provide key characteristics to contrast the data-centric paradigm to the model-centric one, and introduce a framework for data-centric AI. We distinguish data-centric AI from related concepts and discuss its longer-term implications for the IS community.
翻译:数据为中心的人工智能(data-centric AI)代表一种新兴范式,强调数据的系统性设计与工程化对于构建高效能、高效率的基于人工智能的系统至关重要。本文旨在向信息系统(IS)领域的研究人员与从业者介绍数据为中心的人工智能。我们定义了相关术语,提供了关键特征以对比数据为中心范式与模型为中心范式,并引入了一个数据为中心的人工智能框架。我们区分了数据为中心的人工智能与相关概念,并探讨了其对IS社区的长期影响。