This paper presents a robust monocular visual SLAM system that simultaneously utilizes point, line, and vanishing point features for accurate camera pose estimation and mapping. To address the critical challenge of achieving reliable localization in low-texture environments, where traditional point-based systems often fail due to insufficient visual features, we introduce a novel approach leveraging Global Primitives structural information to improve the system's robustness and accuracy performance. Our key innovation lies in constructing vanishing points from line features and proposing a weighted fusion strategy to build Global Primitives in the world coordinate system. This strategy associates multiple frames with non-overlapping regions and formulates a multi-frame reprojection error optimization, significantly improving tracking accuracy in texture-scarce scenarios. Evaluations on various datasets show that our system outperforms state-of-the-art methods in trajectory precision, particularly in challenging environments.
翻译:本文提出了一种鲁棒的单目视觉SLAM系统,该系统同时利用点、线和消失点特征来实现精确的相机位姿估计与地图构建。针对低纹理环境中传统基于点的系统常因视觉特征不足而失效、导致定位可靠性不足这一关键挑战,我们引入了一种利用全局基元结构信息的新方法,以提升系统的鲁棒性与精度性能。我们的核心创新在于从线特征构建消失点,并提出一种加权融合策略以在世界坐标系中建立全局基元。该策略将多个具有非重叠区域的帧关联起来,并构建了多帧重投影误差优化模型,从而显著提升了在纹理匮乏场景下的跟踪精度。在多个数据集上的评估表明,本系统在轨迹精度方面优于现有先进方法,尤其在具有挑战性的环境中表现更为突出。