The matching of 3D shapes has been extensively studied for shapes represented as surface meshes, as well as for shapes represented as point clouds. While point clouds are a common representation of raw real-world 3D data (e.g. from laser scanners), meshes encode rich and expressive topological information, but their creation typically requires some form of (often manual) curation. In turn, methods that purely rely on point clouds are unable to meet the matching quality of mesh-based methods that utilise the additional topological structure. In this work we close this gap by introducing a self-supervised multimodal learning strategy that combines mesh-based functional map regularisation with a contrastive loss that couples mesh and point cloud data. Our shape matching approach allows to obtain intramodal correspondences for triangle meshes, complete point clouds, and partially observed point clouds, as well as correspondences across these data modalities. We demonstrate that our method achieves state-of-the-art results on several challenging benchmark datasets even in comparison to recent supervised methods, and that our method reaches previously unseen cross-dataset generalisation ability.
翻译:3D形状的匹配问题已在表示为表面网格和点云的形状上得到广泛研究。点云是原始真实世界3D数据(如来自激光扫描仪)的常见表示形式,而网格则编码了丰富且富有表现力的拓扑信息,但其创建通常需要某种形式的(通常为人工)处理。因此,纯粹依赖点云的方法无法达到利用额外拓扑结构的基于网格方法的匹配质量。在本工作中,我们通过引入一种自监督多模态学习策略来弥合这一差距,该策略将基于网格的函数映射正则化与耦合网格和点云数据的对比损失相结合。我们的形状匹配方法能够获得三角形网格、完整点云和部分观测点云的模态内对应关系,以及跨这些数据模态的对应关系。我们证明,即使与最近的监督方法相比,我们的方法在多个具有挑战性的基准数据集上取得了最先进的结果,并且我们的方法达到了前所未有的跨数据集泛化能力。