SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking

3D single object tracking (SOT) is an important and challenging task for the autonomous driving and mobile robotics. Most existing methods perform tracking between two consecutive frames while ignoring the motion patterns of the target over a series of frames, which would cause performance degradation in the scenes with sparse points. To break through this limitation, we introduce Sequence-to-Sequence tracking paradigm and a tracker named SeqTrack3D to capture target motion across continuous frames. Unlike previous methods that primarily adopted three strategies: matching two consecutive point clouds, predicting relative motion, or utilizing sequential point clouds to address feature degradation, our SeqTrack3D combines both historical point clouds and bounding box sequences. This novel method ensures robust tracking by leveraging location priors from historical boxes, even in scenes with sparse points. Extensive experiments conducted on large-scale datasets show that SeqTrack3D achieves new state-of-the-art performances, improving by 6.00% on NuScenes and 14.13% on Waymo dataset. The code will be made public at https://github.com/aron-lin/seqtrack3d.

翻译：三维单目标跟踪（SOT）是自动驾驶和移动机器人领域中一项重要且具有挑战性的任务。现有大多数方法仅基于连续两帧之间进行跟踪，忽略了目标在一系列帧中的运动模式，这会在点云稀疏的场景中导致性能下降。为突破这一局限，我们引入了序列到序列（Sequence-to-Sequence）跟踪范式，并提出了名为SeqTrack3D的跟踪器，以捕捉目标在连续帧间的运动。与先前主要采用三种策略（匹配两帧连续点云、预测相对运动、或利用序列点云解决特征退化）的方法不同，我们的SeqTrack3D结合了历史点云与边界框序列。这种新颖方法能够利用历史边界框的位置先验信息，即使在点云稀疏的场景中也能实现鲁棒跟踪。在大规模数据集上的大量实验表明，SeqTrack3D达到了新的最先进性能，在NuScenes数据集上提升6.00%，在Waymo数据集上提升14.13%。代码将开源在https://github.com/aron-lin/seqtrack3d。

相关内容

点云

关注 0

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日