GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

Recently, 3D Gaussian splatting (3D-GS) has achieved great success in reconstructing and rendering real-world scenes. To transfer the high rendering quality to generation tasks, a series of research works attempt to generate 3D-Gaussian assets from text. However, the generated assets have not achieved the same quality as those in reconstruction tasks. We observe that Gaussians tend to grow without control as the generation process may cause indeterminacy. Aiming at highly enhancing the generation quality, we propose a novel framework named GaussianDreamerPro. The main idea is to bind Gaussians to reasonable geometry, which evolves over the whole generation process. Along different stages of our framework, both the geometry and appearance can be enriched progressively. The final output asset is constructed with 3D Gaussians bound to mesh, which shows significantly enhanced details and quality compared with previous methods. Notably, the generated asset can also be seamlessly integrated into downstream manipulation pipelines, e.g. animation, composition, and simulation etc., greatly promoting its potential in wide applications. Demos are available at https://taoranyi.com/gaussiandreamerpro/.

翻译：近年来，三维高斯泼溅（3D-GS）技术在真实场景重建与渲染领域取得了巨大成功。为将高质量渲染能力迁移至生成任务，一系列研究工作尝试从文本生成三维高斯数字资产。然而，现有方法生成的资产质量尚未达到重建任务的水平。我们观察到，生成过程的不确定性会导致高斯分布无约束扩散。为显著提升生成质量，本文提出名为GaussianDreamerPro的新型框架。其核心思想是将高斯分布绑定至合理几何结构，该结构在生成过程中持续演化。通过框架各阶段的迭代，几何形态与外观特征均可实现渐进式优化。最终输出的数字资产由绑定至网格的三维高斯模型构成，与现有方法相比在细节表现与整体质量上均有显著提升。值得注意的是，生成资产可无缝集成至下游操控流程（如动画制作、场景合成、物理仿真等），极大拓展了其应用潜力。演示视频详见https://taoranyi.com/gaussiandreamerpro/。

相关内容

ASSETS

关注 0

ACM SIGACCESS Conference on Computers and Accessibility是为残疾人和老年人提供与计算机相关的设计、评估、使用和教育研究的首要论坛。我们欢迎提交原始的高质量的有关计算和可访问性的主题。今年，ASSETS首次将其范围扩大到包括关于计算机无障碍教育相关主题的原创高质量研究。官网链接：http://assets19.sigaccess.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日