To enhance perception performance in complex and extensive scenarios within the realm of autonomous driving, there has been a noteworthy focus on temporal modeling, with a particular emphasis on streaming methods. The prevailing trend in streaming models involves the utilization of stream queries for the propagation of temporal information. Despite the prevalence of this approach, the direct application of the streaming paradigm to the construction of vectorized high-definition maps (HD-maps) fails to fully harness the inherent potential of temporal information. This paper introduces the Stream Query Denoising (SQD) strategy as a novel approach for temporal modeling in high-definition map (HD-map) construction. SQD is designed to facilitate the learning of temporal consistency among map elements within the streaming model. The methodology involves denoising the queries that have been perturbed by the addition of noise to the ground-truth information from the preceding frame. This denoising process aims to reconstruct the ground-truth information for the current frame, thereby simulating the prediction process inherent in stream queries. The SQD strategy can be applied to those streaming methods (e.g., StreamMapNet) to enhance the temporal modeling. The proposed SQD-MapNet is the StreamMapNet equipped with SQD. Extensive experiments on nuScenes and Argoverse2 show that our method is remarkably superior to other existing methods across all settings of close range and long range. The code will be available soon.
翻译:为了在自动驾驶领域的复杂和大规模场景中提升感知性能,时间建模引起了显著关注,尤其是流式方法。流式模型的主流趋势是利用流式查询来传播时间信息。尽管该方法普遍存在,但将流式范式直接应用于矢量化高清地图(HD-maps)的构建未能充分利用时间信息的固有潜力。本文提出了流式查询去噪(SQD)策略,作为一种用于高清地图(HD-map)构建中时间建模的新方法。SQD旨在促进流式模型中地图元素间时间一致性的学习。该方法涉及对通过向前一帧真实值信息添加噪声而扰动的查询进行去噪。该去噪过程旨在重建当前帧的真实值信息,从而模拟流式查询中的预测过程。SQD策略可应用于那些流式方法(如StreamMapNet)以增强时间建模。所提出的SQD-MapNet是配备SQD的StreamMapNet版本。在nuScenes和Argoverse2上的大量实验表明,我们的方法在近距和远距的所有设置下均显著优于现有其他方法。代码将很快公开。