AIFL: A Global Daily Streamflow Forecasting Model Using Deterministic LSTM Pre-trained on ERA5-Land and Fine-tuned on IFS

Maria Luisa Taccari,Kenza Tazi,Oisín M. Morrison,Andreas Grafberger,Juan Colonese,Corentin Carton de Wiart,Christel Prudhomme,Cinzia Mazzetti,Matthew Chantry,Florian Pappenberger

Reliable global streamflow forecasting is essential for flood preparedness and water resource management, yet data-driven models often suffer from a performance gap when transitioning from historical reanalysis to operational forecast products. This paper introduces AIFL (Artificial Intelligence for Floods), a deterministic LSTM-based model designed for global daily streamflow forecasting. Trained on 18,588 basins curated from the CARAVAN dataset, AIFL utilises a novel two-stage training strategy to bridge the reanalysis-to-forecast domain shift. The model is first pre-trained on 40 years of ERA5-Land reanalysis (1980-2019) to capture robust hydrological processes, then fine-tuned on operational Integrated Forecasting System (IFS) control forecasts (2016-2019) to adapt to the specific error structures and biases of operational numerical weather prediction. To our knowledge, this is the first global model trained end-to-end within the CARAVAN ecosystem. On an independent temporal test set (2021-2024), AIFL achieves high predictive skill with a median modified Kling-Gupta Efficiency (KGE') of 0.66 and a median Nash-Sutcliffe Efficiency (NSE) of 0.53. Benchmarking results show that AIFL is highly competitive with current state-of-the-art global systems, achieving comparable accuracy while maintaining a transparent and reproducible forcing pipeline. The model demonstrates exceptional reliability in extreme-event detection, providing a streamlined and operationally robust baseline for the global hydrological community.

翻译：可靠的全球径流预报对于防洪准备和水资源管理至关重要，然而数据驱动模型在从历史再分析过渡到业务预报产品时常常存在性能差距。本文提出了AIFL（面向洪水的人工智能），一种基于确定性LSTM的模型，专为全球日径流预报设计。该模型在CARAVAN数据集中精选的18,588个流域上进行训练，采用了一种新颖的两阶段训练策略以弥合再分析至预报的领域偏移。模型首先在40年的ERA5-Land再分析数据（1980-2019）上进行预训练，以捕捉稳健的水文过程；随后在业务化集成预报系统（IFS）控制预报数据（2016-2019）上进行微调，以适应业务数值天气预报特有的误差结构和偏差。据我们所知，这是在CARAVAN生态系统中首个端到端训练的全球模型。在独立的时间测试集（2021-2024）上，AIFL展现出较高的预测能力，其中位修正克林-古普塔效率（KGE'）达到0.66，中位纳什-萨特克利夫效率（NSE）达到0.53。基准测试结果表明，AIFL与当前最先进的全球系统相比具有高度竞争力，在保持透明且可复现的驱动流程的同时，达到了相当的精度。该模型在极端事件检测中表现出卓越的可靠性，为全球水文学界提供了一个精简且业务稳健的基准。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

《用于水文建模应用的美国空军全球空陆天气开发模型数据流程：GALWEM采集系统v1.0与v2.0概述》最新报告

专知会员服务

18+阅读 · 2025年12月27日

大型语言模型（LLM）智能体全栈安全的综述：数据、训练与部署

专知会员服务

33+阅读 · 2025年4月23日

面向战场移动威胁的预测模型：利用预测性数据模型打击大规模移动目标

专知会员服务

43+阅读 · 2024年12月23日

大模型如何预测天气？悉尼科技大学等最新《天气和气候数据理解的基础模型》综述

专知会员服务

49+阅读 · 2023年12月9日