The advancement of visual intelligence is intrinsically tethered to the availability of data. In parallel, generative Artificial Intelligence (AI) has unlocked the potential to create synthetic images that closely resemble real-world photographs, which prompts a compelling inquiry: how visual intelligence benefit from the advance of generative AI? This paper explores the innovative concept of harnessing these AI-generated images as a new data source, reshaping traditional model paradigms in visual intelligence. In contrast to real data, AI-generated data sources exhibit remarkable advantages, including unmatched abundance and scalability, the rapid generation of vast datasets, and the effortless simulation of edge cases. Built on the success of generative AI models, we examines the potential of their generated data in a range of applications, from training machine learning models to simulating scenarios for computational modelling, testing, and validation. We probe the technological foundations that support this groundbreaking use of generative AI, engaging in an in-depth discussion on the ethical, legal, and practical considerations that accompany this transformative paradigm shift. Through an exhaustive survey of current technologies and applications, this paper presents a comprehensive view of the synthetic era in visual intelligence. A project with this paper can be found at https://github.com/mwxely/AIGS .
翻译:视觉智能的进步本质上依赖于数据的可用性。与此同时,生成式人工智能(AI)开辟了创造与真实照片高度相似的合成图像的潜力,这引发了一个引人深思的问题:视觉智能如何从生成式AI的进步中获益?本文探讨了利用这些AI生成图像作为新型数据源的创新概念,重塑了视觉智能中传统的模型范式。与真实数据相比,AI生成的数据源展现出显著优势,包括无与伦比的丰富性和可扩展性、海量数据集的快速生成,以及边缘案例的轻松模拟。基于生成式AI模型的成功,我们审视了其生成数据在从训练机器学习模型到为计算建模、测试和验证模拟场景等一系列应用中的潜力。我们探究了支持这一突破性生成式AI应用的技术基础,并就伴随这一变革性范式转变的伦理、法律和实践考量进行了深入讨论。通过对当前技术和应用的详尽调查,本文呈现了视觉智能合成纪元的全面图景。本文相关项目可访问 https://github.com/mwxely/AIGS 查看。