MASSTAR: A Multi-Modal and Large-Scale Scene Dataset with a Versatile Toolchain for Surface Prediction and Completion

Surface prediction and completion have been widely studied in various applications. Recently, research in surface completion has evolved from small objects to complex large-scale scenes. As a result, researchers have begun increasing the volume of data and leveraging a greater variety of data modalities including rendered RGB images, descriptive texts, depth images, etc, to enhance algorithm performance. However, existing datasets suffer from a deficiency in the amounts of scene-level models along with the corresponding multi-modal information. Therefore, a method to scale the datasets and generate multi-modal information in them efficiently is essential. To bridge this research gap, we propose MASSTAR: a Multi-modal lArge-scale Scene dataset with a verSatile Toolchain for surfAce pRediction and completion. We develop a versatile and efficient toolchain for processing the raw 3D data from the environments. It screens out a set of fine-grained scene models and generates the corresponding multi-modal data. Utilizing the toolchain, we then generate an example dataset composed of over a thousand scene-level models with partial real-world data added. We compare MASSTAR with the existing datasets, which validates its superiority: the ability to efficiently extract high-quality models from complex scenarios to expand the dataset. Additionally, several representative surface completion algorithms are benchmarked on MASSTAR, which reveals that existing algorithms can hardly deal with scene-level completion. We will release the source code of our toolchain and the dataset. For more details, please see our project page at https://sysu-star.github.io/MASSTAR.

翻译：表面预测与补全已在多种应用中被广泛研究。近年来，表面补全的研究已从小物体扩展到复杂的大规模场景。为此，研究者开始增加数据规模并利用更多种类的数据模态（包括渲染RGB图像、描述性文本、深度图像等）来提升算法性能。然而，现有数据集在场景级模型数量及其对应的多模态信息方面存在不足。因此，亟需一种能够高效扩展数据集并生成其中多模态信息的方法。为填补这一研究空白，我们提出了MASSTAR：一个面向表面预测与补全的多模态大规模场景数据集与通用工具链。我们开发了一套通用且高效的工具链，用于处理环境中的原始3D数据。该工具链能够筛选出一组精细的场景模型，并生成相应的多模态数据。利用该工具链，我们生成了一个包含上千个场景级模型并融合部分真实世界数据的示例数据集。我们将MASSTAR与现有数据集进行对比，验证了其优越性——能够从复杂场景中高效提取高质量模型以扩展数据集。此外，我们在MASSTAR上对多种代表性表面补全算法进行了基准测试，结果表明现有算法难以应对场景级补全任务。我们将公开工具链的源代码及数据集。更多详情请参见项目页面：https://sysu-star.github.io/MASSTAR。

相关内容

Microsoft Surface

关注 5

Surface 是微软公司（ Microsoft）旗下一系列使用 Windows 10（早期为 Windows 8.X）操作系统的电脑产品，目前有 Surface、Surface Pro 和 Surface Book 三个系列。 2012 年 6 月 18 日，初代 Surface Pro/RT 由时任微软 CEO 史蒂夫·鲍尔默发布于在洛杉矶举行的记者会，2012 年 10 月 26 日上市销售。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日