While AI-generated text and 2D images continue to expand its territory, 3D generation has gradually emerged as a trend that cannot be ignored. Since the year 2023 an abundant amount of research papers has emerged in the domain of 3D generation. This growth encompasses not just the creation of 3D objects, but also the rapid development of 3D character and motion generation. Several key factors contribute to this progress. The enhanced fidelity in stable diffusion, coupled with control methods that ensure multi-view consistency, and realistic human models like SMPL-X, contribute synergistically to the production of 3D models with remarkable consistency and near-realistic appearances. The advancements in neural network-based 3D storing and rendering models, such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), have accelerated the efficiency and realism of neural rendered models. Furthermore, the multimodality capabilities of large language models have enabled language inputs to transcend into human motion outputs. This paper aims to provide a comprehensive overview and summary of the relevant papers published mostly during the latter half year of 2023. It will begin by discussing the AI generated object models in 3D, followed by the generated 3D human models, and finally, the generated 3D human motions, culminating in a conclusive summary and a vision for the future.
翻译:随着AI生成的文本和二维图像不断拓展其疆域,三维生成技术已逐渐成为不可忽视的趋势。自2023年以来,三维生成领域涌现出大量研究论文。这一增长不仅涵盖三维物体的创建,还包括三维角色与运动生成的快速发展。若干关键因素推动了这一进展:稳定扩散模型保真度的提升、确保多视图一致性的控制方法、以及SMPL-X等逼真人体模型,共同协同生成具有惊人一致性与近乎真实外观的三维模型;基于神经网络的神经辐射场(NeRF)与三维高斯泼溅(3DGS)等三维存储与渲染模型的进步,加速了神经渲染模型的效率与逼真度;此外,大语言模型的多模态能力使语言输入能够跨越至人体运动输出。本文旨在对2023年下半年发表的相关论文进行全面的概述与总结,将首先讨论AI生成的三维物体模型,继而探讨生成的三维人体模型,最后聚焦生成的三维人体运动,并最终以结论性总结与未来展望收尾。