Spatio-temporal forecasting is fundamental to intelligent systems in transportation, climate science, and urban planning. However, training deep learning models on the massive, often redundant, datasets from these domains presents a significant computational bottleneck. Existing solutions typically focus on optimizing model architectures or optimizers, while overlooking the inherent inefficiency of the training data itself. This conventional approach of iterating over the entire static dataset each epoch wastes considerable resources on easy-to-learn or repetitive samples. In this paper, we explore a novel training-efficiency techniques, namely learning from complexity with dynamic sample pruning, ST-Prune, for spatio-temporal forecasting. Through dynamic sample pruning, we aim to intelligently identify the most informative samples based on the model's real-time learning state, thereby accelerating convergence and improving training efficiency. Extensive experiments conducted on real-world spatio-temporal datasets show that ST-Prune significantly accelerates the training speed while maintaining or even improving the model performance, and it also has scalability and universality.
翻译:时空预测是交通、气候科学和城市规划等领域智能系统的基础。然而,在这些领域海量且通常冗余的数据集上训练深度学习模型,构成了显著的计算瓶颈。现有解决方案通常侧重于优化模型架构或优化器,却忽视了训练数据本身固有的低效性。这种在每个训练周期迭代整个静态数据集的传统方法,在易于学习或重复的样本上浪费了大量资源。本文探索了一种新颖的训练效率技术,即基于复杂性的动态样本剪枝学习——ST-Prune,用于时空预测。通过动态样本剪枝,我们旨在根据模型的实时学习状态智能识别信息量最大的样本,从而加速收敛并提高训练效率。在真实世界时空数据集上进行的大量实验表明,ST-Prune在保持甚至提升模型性能的同时,显著加快了训练速度,并且具备良好的可扩展性和普适性。