The Knowable Future: Mapping the Decay of Past-Future Mutual Information Across Forecast Horizons

Forecasting fails not because models are weak, but because effort is wasted on series whose futures are fundamentally unknowable. We propose a pre-modelling diagnostic framework that assesses horizon-specific forecastability before model selection begins, enabling practitioners to allocate effort where it can yield returns. We operationalise forecastability as auto-mutual information (AMI) at lag h, measuring the reduction in uncertainty about future values provided by the past. Using a k-nearest-neighbour estimator on training data only, we validate AMI against realised out-of-sample error (sMAPE) across 1,350 M4 series spanning six frequencies, with Seasonal Naive, ETS, and N-BEATS as probe models under a rolling-origin protocol. The central finding is that the AMI-sMAPE relationship is strongly frequency-conditional. For Weekly, Hourly, Quarterly, and Yearly series, AMI exhibits consistent negative rank association with realised error (p ranging from -0.52 to -0.66 for higher-capacity probes), supporting its use as a triage signal. Monthly shows moderate association; Daily exhibits weaker discrimination despite measurable dependence. Across all frequencies, median forecast error decreases monotonically from low to high AMI terciles, confirming decision-relevant separation. These results establish AMI as a practical screening tool for forecasting portfolios: identifying series where sophisticated modelling is warranted, where baselines suffice, and where effort should shift from accuracy improvement to robust decision design.

翻译：预测失败并非因为模型能力不足，而是因为努力被浪费在了那些未来本质上不可知的序列上。我们提出一种建模前诊断框架，可在模型选择开始前评估特定时域的预测可行性，使实践者能将努力分配到可能产生回报的领域。我们将预测可行性操作化为滞后h处的自互信息（AMI），用以衡量过去信息对未来值不确定性的减少程度。仅使用训练数据并采用k近邻估计器，我们在涵盖六种频率的1,350条M4序列上，以季节性朴素法、ETS和N-BEATS作为探针模型，在滚动原点协议下验证了AMI与已实现样本外误差（sMAPE）的关系。核心发现是：AMI-sMAPE关系具有强烈的频率依赖性。对于周度、小时、季度和年度序列，AMI与已实现误差呈现一致的负秩相关（高容量探针的p值范围从-0.52至-0.66），支持其作为分流信号的适用性。月度序列呈现中等相关性；日度序列尽管存在可测量的依赖性，但区分能力较弱。在所有频率中，中位预测误差从低到高AMI三分位数组呈现单调递减，证实了具有决策意义的区分度。这些结果确立了AMI作为预测组合的实用筛选工具：可识别哪些序列需要复杂建模、哪些序列基线方法已足够，以及哪些序列应将努力方向从精度提升转向稳健决策设计。