Series of univariate distributions indexed by equally spaced time points are ubiquitous in applications and their analysis constitutes one of the challenges of the emerging field of distributional data analysis. To quantify such distributional time series, we propose a class of intrinsic autoregressive models that operate in the space of optimal transport maps. The autoregressive transport models that we introduce here are based on regressing optimal transport maps on each other, where predictors can be transport maps from an overall barycenter to a current distribution or transport maps between past consecutive distributions of the distributional time series. Autoregressive transport models and their associated distributional regression models specify the link between predictor and response transport maps by moving along geodesics in Wasserstein space. These models emerge as natural extensions of the classical autoregressive models in Euclidean space. Unique stationary solutions of autoregressive transport models are shown to exist under a geometric moment contraction condition of Wu and Shao (2004), using properties of iterated random functions. We also discuss an extension to a varying coefficient model for first order autoregressive transport models. In addition to simulations, the proposed models are illustrated with distributional time series of house prices across U.S. counties and annual summer temperature distributions.
翻译:在等间隔时间点上索引的单变量分布序列在应用中普遍存在,其分析构成了新兴分布数据分析领域的挑战之一。为量化此类分布时间序列,我们提出了一类在最优传输映射空间中运行的内源自回归模型。本文引入的自回归传输模型基于最优传输映射之间的互回归,其中预测变量可以是全局重心到当前分布的传输映射,也可以是分布时间序列中连续分布之间的历史传输映射。自回归传输模型及其相关的分布回归模型通过沿Wasserstein空间中的测地线移动来指定预测变量与响应变量传输映射之间的关联,这些模型可视为欧氏空间中经典自回归模型的自然延伸。利用Wu和Shao(2004)的几何矩收缩条件及迭代随机函数性质,我们证明了自回归传输模型在唯一平稳解存在性上的条件。此外,我们讨论了一阶自回归传输模型的变系数模型扩展。除数值模拟外,本文还通过美国各县房价的分布时间序列及年度夏季气温分布实例对提出模型进行了验证。