We present NeuSE, a novel Neural SE(3)-Equivariant Embedding for objects, and illustrate how it supports object SLAM for consistent spatial understanding with long-term scene changes. NeuSE is a set of latent object embeddings created from partial object observations. It serves as a compact point cloud surrogate for complete object models, encoding full shape information while transforming SE(3)-equivariantly in tandem with the object in the physical world. With NeuSE, relative frame transforms can be directly derived from inferred latent codes. Our proposed SLAM paradigm, using NeuSE for object shape and pose characterization, can operate independently or in conjunction with typical SLAM systems. It directly infers SE(3) camera pose constraints that are compatible with general SLAM pose graph optimization, while also maintaining a lightweight object-centric map that adapts to real-world changes. Our approach is evaluated on synthetic and real-world sequences featuring changed objects and shows improved localization accuracy and change-aware mapping capability, when working either standalone or jointly with a common SLAM pipeline.
翻译:我们提出NeuSE——一种新颖的神经SE(3)等变物体嵌入方法,并阐释其如何支持物体SLAM以实现长期场景变化下的一致性空间理解。NeuSE由局部物体观测生成的潜在物体嵌入集合构成,它作为完整物体模型的紧凑点云替代,在编码完整形状信息的同时,与物理世界中的物体同步进行SE(3)等变变换。通过NeuSE,可直接从推断的潜在编码中导出相对帧变换。我们提出的SLAM范式利用NeuSE进行物体形状与位姿表征,既可独立运行,也可与典型SLAM系统协同工作。该方法能直接推断与通用SLAM位姿图优化兼容的SE(3)相机位姿约束,同时维护适应现实世界变化的轻量级物体中心地图。我们在包含物体变化的合成与真实世界序列上评估了该方法,结果表明,当独立运行或与常规SLAM管道联合使用时,该方法均展现出更优的定位精度与感知变化的地图构建能力。