Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations

The perception of motion behavior in a dynamic environment holds significant importance for autonomous driving systems, wherein class-agnostic motion prediction methods directly predict the motion of the entire point cloud. While most existing methods rely on fully-supervised learning, the manual labeling of point cloud data is laborious and time-consuming. Therefore, several annotation-efficient methods have been proposed to address this challenge. Although effective, these methods rely on weak annotations or additional multi-modal data like images, and the potential benefits inherent in the point cloud sequence are still underexplored. To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds. Initially, we employ an optimal transport solver to establish coarse correspondences between current and future point clouds as the coarse pseudo motion labels. Training models directly using such coarse labels leads to noticeable spatial and temporal prediction inconsistencies. To mitigate these issues, we introduce three simple spatial and temporal regularization losses, which facilitate the self-supervised training process effectively. Experimental results demonstrate the significant superiority of our approach over the state-of-the-art self-supervised methods.

翻译：动态环境下的运动行为感知对于自动驾驶系统至关重要，其中类无关运动预测方法可直接预测整个点云的运动。现有方法大多依赖全监督学习，但点云数据的人工标注费时费力。因此，已有多项标注高效方法被提出以应对这一挑战。尽管这些方法有效，但它们依赖弱标注或图像等额外多模态数据，且点云序列中蕴含的潜在优势尚未被充分挖掘。为此，我们探索仅利用无标注激光雷达点云进行自监督运动预测的可行性。首先，采用最优传输求解器建立当前与未来点云之间的粗略对应关系作为粗粒度伪运动标签。直接使用此类粗标签训练模型会导致显著的时空预测不一致性。为缓解该问题，我们引入三种简单的时空正则化损失函数，有效促进了自监督训练过程。实验结果表明，我们的方法显著优于现有最优的自监督方法。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日