Forecasting dynamic scenes remains a fundamental challenge in computer vision, as limited observations make it difficult to capture coherent object-level motion and long-term temporal evolution. We present Motion Group-aware Gaussian Forecasting (MoGaF), a framework for long-term scene extrapolation built upon the 4D Gaussian Splatting representation. MoGaF introduces motion-aware Gaussian grouping and group-wise optimization to enforce physically consistent motion across both rigid and non-rigid regions, yielding spatially coherent dynamic representations. Leveraging this structured space-time representation, a lightweight forecasting module predicts future motion, enabling realistic and temporally stable scene evolution. Experiments on synthetic and real-world datasets demonstrate that MoGaF consistently outperforms existing baselines in rendering quality, motion plausibility, and long-term forecasting stability. Our project page is available at https://slime0519.github.io/mogaf
翻译:动态场景预测仍是计算机视觉中的一项基础性挑战,由于观测数据有限,难以捕捉连贯的物体级运动与长期时间演化。我们提出运动分组感知高斯预测框架(MoGaF),该框架基于4D高斯溅射表示进行长期场景外推。MoGaF通过引入运动感知高斯分组与分组优化策略,在刚性与非刚性区域均能保持物理一致性运动,从而生成空间连贯的动态表征。依托这种结构化的时空表示,一个轻量预测模块可预测未来运动,实现逼真且时间稳定的场景演化。在合成与真实数据集上的实验表明,MoGaF在渲染质量、运动合理性及长期预测稳定性方面持续优于现有基线方法。项目页面:https://slime0519.github.io/mogaf