3D Gaussian Splatting (GS) have achieved considerable improvement over Neural Radiance Fields in terms of 3D fitting fidelity and rendering speed. However, this unstructured representation with scattered Gaussians poses a significant challenge for generative modeling. To address the problem, we introduce GaussianCube, a structured GS representation that is both powerful and efficient for generative modeling. We achieve this by first proposing a modified densification-constrained GS fitting algorithm which can yield high-quality fitting results using a fixed number of free Gaussians, and then re-arranging the Gaussians into a predefined voxel grid via Optimal Transport. The structured grid representation allows us to use standard 3D U-Net as our backbone in diffusion generative modeling without elaborate designs. Extensive experiments conducted on ShapeNet and OmniObject3D show that our model achieves state-of-the-art generation results both qualitatively and quantitatively, underscoring the potential of GaussianCube as a powerful and versatile 3D representation.
翻译:三维高斯泼溅(GS)在三维拟合保真度和渲染速度方面相比神经辐射场取得了显著提升。然而,这种包含离散高斯的非结构化表示为生成建模带来了重大挑战。为解决该问题,我们提出GaussianCube,这是一种兼具强大性能与高效性的结构化GS表示方法。我们首先提出一种改进的致密化约束GS拟合算法,该算法能够通过固定数量的自由高斯获得高质量拟合结果;随后通过最优传输将高斯重新排列至预定义的体素网格中。这种结构化网格表示使我们能够使用标准三维U-Net作为扩散生成建模中的骨干网络,而无需复杂的设计。在ShapeNet和OmniObject3D上的大量实验表明,我们的模型在定性和定量指标上均达到最优生成效果,这凸显了GaussianCube作为强大且通用的三维表示方法的潜力。