On the Limits of Prediction: Forecastability Profiles and Information Decay in Time Series

Forecasting accuracy is bounded by the information available about the future. This paper makes that statement precise using information-theoretic tools. Under logarithmic loss, the expected performance of any probabilistic forecast decomposes into two parts: an irreducible component and an approximation component. The irreducible term is the conditional entropy of the future given the available information, while the approximation term is the divergence between the true conditional distribution and the forecasting method. The gap between this conditional-entropy limit and an unconditional baseline is exactly the mutual information between the future observation and the declared information set. This leads to a definition of forecastability as the maximum achievable reduction in expected log loss. Evaluated across horizons, forecastability forms a profile that describes how predictive information varies with lead time. This profile reflects the dependence structure of the process and need not be monotone: predictive information may be concentrated at particular lags, including seasonal horizons, even when intermediate horizons contain little useful signal. From this profile, the paper defines the informative horizon set: the horizons at which forecastability exceeds a practical threshold. At horizons not in this set, the achievable gain over the unconditional baseline is necessarily small, regardless of the forecasting method used. The framework therefore separates what is learnable from what is not, and distinguishes limits imposed by the data from errors introduced by modelling. The result is a pre-modelling diagnostic that identifies where meaningful prediction is feasible before any model is chosen, providing a principled basis for allocating modelling effort across forecast horizons.

翻译：预测精度受限于可获取的未来信息。本文利用信息论工具对该论断进行了精确刻画。在对数损失函数下，任何概率预测的期望性能均可分解为不可约分量与近似分量两部分。不可约项为给定可用信息条件下未来的条件熵，而近似项则是真实条件分布与预测方法之间的散度。该条件熵极限与无条件基线之间的差距恰好等于未来观测值与所声明信息集之间的互信息。由此将可预测性定义为预期对数损失最大可降低幅度。通过跨时间维度的评估，可预测性形成描述预测信息如何随提前期变化的剖面。该剖面反映了过程的依赖结构且无需单调：预测信息可能集中于特定滞后阶数（包括季节性周期），即便中间时段几乎不含有效信号。据此定义信息性时间段集：可预测性超过实际阈值的时间窗口。对于不在此集合中的时段，无论采用何种预测方法，与无条件基线相比的增益必然有限。因此该框架有效区分了可学习内容与不可学习内容，并明确了数据固有约束与建模引入误差之间的界限。其结果为选择模型前的预分析提供了有效诊断工具——在选定模型前即可识别具备实际预测价值的区间，为跨预测时间尺度配置建模资源奠定了理论基础。