OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

Recent advancements in 3D reconstruction technologies have paved the way for high-quality and real-time rendering of complex 3D scenes. Despite these achievements, a notable challenge persists: it is difficult to precisely reconstruct specific objects from large scenes. Current scene reconstruction techniques frequently result in the loss of object detail textures and are unable to reconstruct object portions that are occluded or unseen in views. To address this challenge, we delve into the meticulous 3D reconstruction of specific objects within large scenes and propose a framework termed OMEGAS: Object Mesh Extraction from Large Scenes Guided by GAussian Segmentation. OMEGAS employs a multi-step approach, grounded in several excellent off-the-shelf methodologies. Specifically, initially, we utilize the Segment Anything Model (SAM) to guide the segmentation of 3D Gaussian Splatting (3DGS), thereby creating a basic 3DGS model of the target object. Then, we leverage large-scale diffusion priors to further refine the details of the 3DGS model, especially aimed at addressing invisible or occluded object portions from the original scene views. Subsequently, by re-rendering the 3DGS model onto the scene views, we achieve accurate object segmentation and effectively remove the background. Finally, these target-only images are used to improve the 3DGS model further and extract the definitive 3D object mesh by the SuGaR model. In various scenarios, our experiments demonstrate that OMEGAS significantly surpasses existing scene reconstruction methods. Our project page is at: https://github.com/CrystalWlz/OMEGAS

翻译：近期三维重建技术的进展为复杂三维场景的高质量实时渲染铺平了道路。然而，尽管取得这些成就，仍存在一个显著挑战：从大场景中精确重建特定物体非常困难。当前的场景重建技术常导致物体细节纹理丢失，且无法重建视野中被遮挡或不可见的物体部分。为应对这一挑战，我们深入研究了大规模场景中特定物体的精细三维重建，并提出名为OMEGAS（Object Mesh Extraction from Large Scenes Guided by GAussian Segmentation）的框架。OMEGAS采用基于多种优秀现成方法的多步骤策略。具体而言，首先利用分割一切模型（SAM）指导三维高斯溅射（3DGS）的分割，从而构建目标物体的基础3DGS模型。然后，借助大规模扩散先验进一步细化3DGS模型的细节，尤其针对原始场景视图中不可见或被遮挡的物体部分。随后，通过将3DGS模型重新渲染至场景视图，实现精确的物体分割并有效移除背景。最后，利用这些仅包含目标的图像进一步优化3DGS模型，并通过SuGaR模型提取最终的三维物体网格。在多种场景下的实验表明，OMEGAS显著超越了现有的场景重建方法。我们的项目页面位于：https://github.com/CrystalWlz/OMEGAS

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日