一种基于视觉信标机遇性利用与航位推算的GNSS拒止大规模应用视觉-惯性定位算法 (A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications)

With the development of smart cities, the demand for continuous pedestrian navigation in large-scale urban environments has significantly increased. While global navigation satellite systems (GNSS) provide low-cost and reliable positioning services, they are often hindered in complex urban canyon environments. Thus, exploring opportunistic signals for positioning in urban areas has become a key solution. Augmented reality (AR) allows pedestrians to acquire real-time visual information. Accordingly, we propose a low-cost visual-inertial positioning solution. This method comprises a lightweight multi-scale group convolution (MSGC)-based visual place recognition (VPR) neural network, a pedestrian dead reckoning (PDR) algorithm, and a visual/inertial fusion approach based on a Kalman filter with gross error suppression. The VPR serves as a conditional observation to the Kalman filter, effectively correcting the errors accumulated through the PDR method. This enables the entire algorithm to ensure the reliability of long-term positioning in GNSS-denied areas. Extensive experimental results demonstrate that our method maintains stable positioning during large-scale movements. Compared to the lightweight MobileNetV3-based VPR method, our proposed VPR solution improves Recall@1 by at least 3\% on two public datasets while reducing the number of parameters by 63.37\%. It also achieves performance that is comparable to the VGG16-based method. The VPR-PDR algorithm improves localization accuracy by more than 40\% compared to the original PDR.

翻译：随着智慧城市的发展，大规模城市环境中连续行人导航的需求显著增加。全球导航卫星系统（GNSS）虽能提供低成本且可靠的定位服务，但在复杂的城市峡谷环境中常受阻碍。因此，探索城市区域中基于机遇性信号的定位已成为关键解决方案。增强现实（AR）技术使行人能够获取实时视觉信息。据此，我们提出一种低成本视觉-惯性定位方案。该方法包含一个基于轻量化多尺度组卷积（MSGC）的视觉位置识别（VPR）神经网络、一个行人航位推算（PDR）算法，以及一种基于带粗差抑制的卡尔曼滤波器的视觉/惯性融合方法。VPR作为卡尔曼滤波器的条件观测值，有效修正了通过PDR方法累积的误差，从而使整个算法能够确保在GNSS拒止区域中长期定位的可靠性。大量实验结果表明，我们的方法在大范围移动过程中能保持稳定的定位性能。与基于轻量化MobileNetV3的VPR方法相比，我们提出的VPR方案在两个公开数据集上将Recall@1指标提升了至少3%，同时参数量减少了63.37%，其性能亦达到与基于VGG16的方法相当的水平。相较于原始PDR，VPR-PDR算法将定位精度提升了40%以上。

相关内容

声纹识别

关注 444

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日