In this paper, we develop a modular neural network for real-time {\color{black}(> 10 Hz)} semantic mapping in uncertain environments, which explicitly updates per-voxel probabilistic distributions within a neural network layer. Our approach combines the reliability of classical probabilistic algorithms with the performance and efficiency of modern neural networks. Although robotic perception is often divided between modern differentiable methods and classical explicit methods, a union of both is necessary for real-time and trustworthy performance. We introduce a novel Convolutional Bayesian Kernel Inference (ConvBKI) layer which incorporates semantic segmentation predictions online into a 3D map through a depthwise convolution layer by leveraging conjugate priors. We compare ConvBKI against state-of-the-art deep learning approaches and probabilistic algorithms for mapping to evaluate reliability and performance. We also create a Robot Operating System (ROS) package of ConvBKI and test it on real-world perceptually challenging off-road driving data.
翻译:本文提出了一种模块化神经网络,用于在不确定环境中进行实时(>10 Hz)语义建图,该网络在神经网络层内显式更新每个体素的概率分布。我们的方法将经典概率算法的可靠性与现代神经网络的性能和效率相结合。尽管机器人感知通常被划分为现代可微分方法和经典显式方法,但二者的结合对于实现实时且可信的性能至关重要。我们提出了一种新颖的卷积贝叶斯核推理(ConvBKI)层,该层通过利用共轭先验,通过深度卷积层将在线语义分割预测整合到三维地图中。我们将ConvBKI与最先进的深度学习方法及概率建图算法进行比较,以评估其可靠性和性能。我们还创建了ConvBKI的机器人操作系统(ROS)软件包,并在真实世界具有感知挑战性的越野驾驶数据上进行了测试。