ChatGPT and its improved variant GPT4 have revolutionized the NLP field with a single model solving almost all text related tasks. However, such a model for computer vision does not exist, especially for 3D vision. This article first provides a brief view on the progress of deep learning in text, image and 3D fields from the model perspective. Moreover, this work further discusses how AIGC evolves from the data perspective. On top of that, this work presents an outlook on the development of AIGC in 3D from the data perspective.
翻译:ChatGPT及其改进版本GPT4以单一模型解决了几乎所有的文本相关任务,彻底革新了自然语言处理领域。然而,计算机视觉领域(尤其是三维视觉)尚不存在此类模型。本文首先从模型视角简要回顾了文本、图像和三维领域的深度学习进展。此外,本文进一步从数据视角探讨了AIGC的演变过程。在此基础上,本文从数据视角对AIGC在三维领域的发展进行了展望。