GDTS：基于目标引导扩散模型与树采样的多模态行人轨迹预测 (GDTS: Goal-Guided Diffusion Model with Tree Sampling for Multi-Modal Pedestrian Trajectory Prediction)

Accurate prediction of pedestrian trajectories is crucial for improving the safety of autonomous driving. However, this task is generally nontrivial due to the inherent stochasticity of human motion, which naturally requires the predictor to generate multi-modal prediction. Previous works leverage various generative methods, such as GAN and VAE, for pedestrian trajectory prediction. Nevertheless, these methods may suffer from mode collapse and relatively low-quality results. The denoising diffusion probabilistic model (DDPM) has recently been applied to trajectory prediction due to its simple training process and powerful reconstruction ability. However, current diffusion-based methods do not fully utilize input information and usually require many denoising iterations that lead to a long inference time or an additional network for initialization. To address these challenges and facilitate the use of diffusion models in multi-modal trajectory prediction, we propose GDTS, a novel Goal-Guided Diffusion Model with Tree Sampling for multi-modal trajectory prediction. Considering the "goal-driven" characteristics of human motion, GDTS leverages goal estimation to guide the generation of the diffusion network. A two-stage tree sampling algorithm is presented, which leverages common features to reduce the inference time and improve accuracy for multi-modal prediction. Experimental results demonstrate that our proposed framework achieves comparable state-of-the-art performance with real-time inference speed in public datasets.

翻译：行人轨迹的精确预测对于提升自动驾驶安全性至关重要。然而，由于人类运动固有的随机性，该任务通常具有挑战性，这自然要求预测器能够生成多模态预测。先前的研究利用多种生成方法（如GAN和VAE）进行行人轨迹预测。然而，这些方法可能面临模式崩溃和生成质量相对较低的问题。去噪扩散概率模型（DDPM）因其简单的训练过程和强大的重建能力，近期被应用于轨迹预测领域。然而，当前基于扩散的方法未能充分利用输入信息，且通常需要大量去噪迭代，导致推理时间较长或需要额外网络进行初始化。为应对这些挑战并促进扩散模型在多模态轨迹预测中的应用，本文提出GDTS——一种用于多模态轨迹预测的新型目标引导扩散模型与树采样方法。考虑到人类运动的“目标驱动”特性，GDTS利用目标估计来引导扩散网络的生成。我们提出了一种两阶段树采样算法，该算法通过利用共同特征来减少推理时间并提升多模态预测的准确性。实验结果表明，所提出的框架在公开数据集上实现了与先进方法相当的性能，并具备实时推理速度。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日