Human motion prediction (HMP) has emerged as a popular research topic due to its diverse applications, but it remains a challenging task due to the stochastic and aperiodic nature of future poses. Traditional methods rely on hand-crafted features and machine learning techniques, which often struggle to model the complex dynamics of human motion. Recent deep learning-based methods have achieved success by learning spatio-temporal representations of motion, but these models often overlook the reliability of motion data. Additionally, the temporal and spatial dependencies of skeleton nodes are distinct. The temporal relationship captures motion information over time, while the spatial relationship describes body structure and the relationships between different nodes. In this paper, we propose a novel spatio-temporal branching network using incremental information for HMP, which decouples the learning of temporal-domain and spatial-domain features, extracts more motion information, and achieves complementary cross-domain knowledge learning through knowledge distillation. Our approach effectively reduces noise interference and provides more expressive information for characterizing motion by separately extracting temporal and spatial features. We evaluate our approach on standard HMP benchmarks and outperform state-of-the-art methods in terms of prediction accuracy.
翻译:人体运动预测因其广泛的应用而成为热门研究课题,但由于未来姿态的随机性和非周期性,这一任务仍具挑战性。传统方法依赖手工特征和机器学习技术,往往难以建模人体运动的复杂动力学特性。近期基于深度学习的模型通过学习运动的时空表征取得进展,但这些方法常忽略运动数据的可靠性。此外,骨架节点的时空依赖性存在差异:时间关系捕捉运动随时间的变化信息,空间关系则描述身体结构及节点间的关联。本文提出一种基于增量信息的时空分支网络用于人体运动预测,该网络解耦时域与空域特征的学习过程,提取更丰富的运动信息,并通过知识蒸馏实现跨领域互补学习。通过分别提取时空特征,本方法有效降低噪声干扰,为刻画运动提供更具表达力的信息。我们在标准人体运动预测基准上评估该方法,在预测精度上超越了现有最优方法。