Interpretable structural model error discovery from sparse assimilation increments using spectral bias-reduced neural networks: A quasi-geostrophic turbulence test case

MoDELS · 稀疏 · CASE · Neural Networks · Networks ·

2023 年 9 月 22 日

翻译：基于谱偏差缩减神经网络从稀疏同化增量中发现可解释的结构性模型误差：准地转湍流测试案例

Rambod Mojgani,Ashesh Chattopadhyay,Pedram Hassanzadeh

from arxiv, 26 pages, 5+1 figures

Earth system models suffer from various structural and parametric errors in their representation of nonlinear, multi-scale processes, leading to uncertainties in their long-term projections. The effects of many of these errors (particularly those due to fast physics) can be quantified in short-term simulations, e.g., as differences between the predicted and observed states (analysis increments). With the increase in the availability of high-quality observations and simulations, learning nudging from these increments to correct model errors has become an active research area. However, most studies focus on using neural networks, which while powerful, are hard to interpret, are data-hungry, and poorly generalize out-of-distribution. Here, we show the capabilities of Model Error Discovery with Interpretability and Data Assimilation (MEDIDA), a general, data-efficient framework that uses sparsity-promoting equation-discovery techniques to learn model errors from analysis increments. Using two-layer quasi-geostrophic turbulence as the test case, MEDIDA is shown to successfully discover various linear and nonlinear structural/parametric errors when full observations are available. Discovery from spatially sparse observations is found to require highly accurate interpolation schemes. While NNs have shown success as interpolators in recent studies, here, they are found inadequate due to their inability to accurately represent small scales, a phenomenon known as spectral bias. We show that a general remedy, adding a random Fourier feature layer to the NN, resolves this issue enabling MEDIDA to successfully discover model errors from sparse observations. These promising results suggest that with further development, MEDIDA could be scaled up to models of the Earth system and real observations.

翻译：地球系统模型在描述非线性多尺度过程时存在各种结构和参数误差，导致其长期预测存在不确定性。许多误差（尤其是快速物理过程引起的误差）的影响可通过短期模拟量化，例如预测状态与观测状态之间的差异（分析增量）。随着高质量观测和模拟数据的增多，从这些增量中学习修正模型误差的“松弛逼近”方法已成为活跃研究领域。然而，现有研究多采用神经网络，虽功能强大但存在解释性差、数据需求大、域外泛化能力弱等问题。本文展示了兼具可解释性与数据同化的模型误差发现框架（MEDIDA）的能力。该通用且数据高效的框架利用稀疏性驱动的方程发现技术，从分析增量中学习模型误差。以两层准地转湍流为测试案例，MEDIDA在完全观测条件下成功发现了线性和非线性结构/参数误差。稀疏空间观测的误差发现需要高精度插值方案。尽管近期研究表明神经网络可作为有效的插值器，但本文发现其因无法准确表征小尺度现象（即谱偏差）而存在局限。我们证明，通过向神经网络的输入添加随机傅里叶特征层这一通用改进方法，可解决该问题，使MEDIDA能从稀疏观测中成功发现模型误差。这些突破性结果表明，通过进一步开发，MEDIDA有望扩展至地球系统模型及真实观测数据。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

《用于无线通信和传感的智能反射面 (IRS)》（ICC 2022）新加坡国立大学2022最新53页slides

专知会员服务

26+阅读 · 2022年11月16日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日