Machine learning models have been employed to perform either physics-free data-driven or hybrid dynamical downscaling of climate data. Most of these implementations operate over relatively small downscaling factors because of the challenge of recovering fine-scale information from coarse data. This limits their compatibility with many global climate model outputs, often available between $\sim$50--100 km resolution, to scales of interest such as cloud resolving or urban scales. This study systematically examines the capability of convolutional neural networks (CNNs) to downscale surface wind speed data over land surface from different coarse resolutions (25 km, 48 km, and 100 km resolution) to 3 km. For each downscaling factor, we consider three CNN configurations that generate super-resolved predictions of fine-scale wind speed, which take between 1 to 3 input fields: coarse wind speed, fine-scale topography, and diurnal cycle. In addition to fine-scale wind speeds, probability density function parameters are generated, through which sample wind speeds can be generated accounting for the intrinsic stochasticity of wind speed. For generalizability assessment, CNN models are tested on regions with different topography and climate that are unseen during training. The evaluation of super-resolved predictions focuses on subgrid-scale variability and the recovery of extremes. Models with coarse wind and fine topography as inputs exhibit the best performance compared with other model configurations, operating across the same downscaling factor. Our diurnal cycle encoding results in lower out-of-sample generalizability compared with other input configurations.
翻译:机器学习模型已被用于对气候数据进行无物理约束的纯数据驱动或混合动力降尺度分析。由于从粗分辨率数据中恢复精细尺度信息的挑战,大多数此类实现仅在相对较小的降尺度因子下运行。这限制了它们与许多全球气候模型输出的兼容性——这些输出分辨率通常在$\sim$50–100公里之间,难以直接应用于云解析或城市尺度等目标尺度。本研究系统考察了卷积神经网络将内陆地表风速数据从不同粗分辨率(25公里、48公里和100公里)降尺度至3公里的能力。针对每个降尺度因子,我们考虑了三种CNN配置,通过1至3个输入场(粗分辨率风速、精细尺度地形和日循环)生成超分辨率精细风速预测。除精细风速外,还生成了概率密度函数参数,可通过这些参数生成考虑风速内在随机性的样本风速。为评估泛化能力,CNN模型在训练过程中未见过的不同地形和气候区域进行测试。超分辨率预测的评估聚焦于亚网格尺度变异性及极端值的恢复。在相同降尺度因子下,以粗分辨率风速和精细地形为输入的模型相较于其他配置表现出最优性能。与其他输入配置相比,日循环编码导致样本外泛化能力降低。