Visual Inertial Odometry (VIO) is an essential component of modern Augmented Reality (AR) applications. However, VIO only tracks the relative pose of the device, leading to drift over time. Absolute pose estimation methods infer the device's absolute pose, but their accuracy depends on the input quality. This paper introduces VIO-APR, a new framework for markerless mobile AR that combines an absolute pose regressor (APR) with a local VIO tracking system. VIO-APR uses VIO to assess the reliability of the APR and the APR to identify and compensate for VIO drift. This feedback loop results in more accurate positioning and more stable AR experiences. To evaluate VIO-APR, we created a dataset that combines camera images with ARKit's VIO system output for six indoor and outdoor scenes of various scales. Over this dataset, VIO-APR improves the median accuracy of popular APR by up to 36\% in position and 29\% in orientation, increases the percentage of frames in the high ($0.25 m, 2^{\circ}$) accuracy level by up to 112\% and reduces the percentage of frames predicted below the low ($5 m, 10^\circ$) accuracy greatly. We implement VIO-APR into a mobile AR application using Unity to demonstrate its capabilities. VIO-APR results in noticeably more accurate localization and a more stable overall experience.
翻译:视觉惯性里程计(VIO)是现代增强现实(AR)应用的核心组件。然而,VIO仅能追踪设备的相对位姿,导致随时间累积漂移。绝对位姿估计方法可推断设备的绝对位姿,但其精度取决于输入质量。本文提出VIO-APR——一种结合绝对位姿回归器(APR)与局部VIO追踪系统的无标记移动AR新框架。VIO-APR利用VIO评估APR的可靠性,同时通过APR识别并补偿VIO漂移。这种闭环反馈机制实现了更精准的定位与更稳定的AR体验。为评估VIO-APR,我们构建了涵盖六个不同规模室内外场景的数据集,该数据集融合相机图像与ARKit VIO系统的输出。在此数据集上,VIO-APR将主流APR的中位精度提升最高达36%(位置)与29%(方向),将高精度(0.25米,2°)帧率提升最高达112%,同时显著降低低精度(5米,10°)帧率占比。我们通过Unity将VIO-APR部署至移动AR应用以验证其能力。VIO-APR展现出显著更精准的定位与更稳定的整体体验。