AVM-SLAM: Semantic Visual SLAM with Multi-Sensor Fusion in a Bird's Eye View for Automated Valet Parking

Automated Valet Parking (AVP) requires precise localization in challenging garage conditions, including poor lighting, sparse textures, repetitive structures, dynamic scenes, and the absence of Global Positioning System (GPS) signals, which often pose problems for conventional localization methods. To address these adversities, we present AVM-SLAM, a semantic visual SLAM framework with multi-sensor fusion in a Bird's Eye View (BEV). Our framework integrates four fisheye cameras, four wheel encoders, and an Inertial Measurement Unit (IMU). The fisheye cameras form an Around View Monitor (AVM) subsystem, generating BEV images. Convolutional Neural Networks (CNNs) extract semantic features from these images, aiding in mapping and localization tasks. These semantic features provide long-term stability and perspective invariance, effectively mitigating environmental challenges. Additionally, data fusion from wheel encoders and IMU enhances system robustness by improving motion estimation and reducing drift. To validate AVM-SLAM's efficacy and robustness, we provide a large-scale, high-resolution underground garage dataset, available at https://github.com/yale-cv/avm-slam. This dataset enables researchers to further explore and assess AVM-SLAM in similar environments.

翻译：自动代客泊车（AVP）需要在具有挑战性的车库环境中实现精确定位，包括光照不良、纹理稀疏、结构重复、动态场景以及全球定位系统（GPS）信号缺失等情况——这些因素通常会对传统定位方法造成困扰。为应对上述不利条件，我们提出AVM-SLAM，一种基于鸟瞰视图（BEV）的多传感器融合语义视觉SLAM框架。该框架集成了四个鱼眼相机、四个轮式编码器以及一个惯性测量单元（IMU）。鱼眼相机构成环视监控（AVM）子系统，生成BEV图像。卷积神经网络（CNN）从这些图像中提取语义特征，用于辅助建图与定位任务。这些语义特征具备长期稳定性和视角不变性，可有效缓解环境挑战。此外，轮式编码器与IMU的数据融合通过改善运动估计并减少漂移，增强了系统鲁棒性。为验证AVM-SLAM的有效性与鲁棒性，我们提供了大规模高分辨率地下车库数据集（获取地址：https://github.com/yale-cv/avm-slam），该数据集能够支持研究者在类似环境中对AVM-SLAM进行深入探索与评估。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日