Visual place recognition (VPR) enables autonomous systems to localize themselves within an environment using image information. While VPR techniques built upon a Convolutional Neural Network (CNN) backbone dominate state-of-the-art VPR performance, their high computational requirements make them unsuitable for platforms equipped with low-end hardware. Recently, a lightweight VPR system based on multiple bio-inspired classifiers, dubbed DrosoNets, has been proposed, achieving great computational efficiency at the cost of reduced absolute place retrieval performance. In this work, we propose a novel multi-DrosoNet localization system, dubbed RegionDrosoNet, with significantly improved VPR performance, while preserving a low-computational profile. Our approach relies on specializing distinct groups of DrosoNets on differently sliced partitions of the original image, increasing extrinsic model differentiation. Furthermore, we introduce a novel voting module to combine the outputs of all DrosoNets into the final place prediction which considers multiple top refence candidates from each DrosoNet. RegionDrosoNet outperforms other lightweight VPR techniques when dealing with both appearance changes and viewpoint variations. Moreover, it competes with computationally expensive methods on some benchmark datasets at a small fraction of their online inference time.
翻译:视觉位置识别(VPR)使自主系统能够利用图像信息在环境中进行定位。尽管以卷积神经网络(CNN)为主干的VPR技术主导了当前最优性能,但其高计算需求使其不适用于搭载低端硬件的平台。近期,一种基于多生物启发分类器的轻量化VPR系统——DrosoNets——被提出,该方案在实现极高计算效率的同时,以降低绝对位置检索性能为代价。本文提出一种新颖的多DrosoNet定位系统——RegionDrosoNet,在保持低计算特性的前提下显著提升了VPR性能。我们的方法通过将原始图像划分为不同切片区域,专门训练不同的DrosoNet分组,从而增强模型间差异化。此外,我们引入新型投票模块来融合所有DrosoNet的输出以生成最终位置预测,该模块可综合每个DrosoNet的多个最优候选参考结果。在应对外观变化和视角变化时,RegionDrosoNet不仅优于其他轻量化VPR技术,还能在部分基准数据集上与高计算成本方法竞争,而其在线推理时间仅为这些方法的极小部分。