Ensemble transform Kalman filtering (ETKF) data assimilation is often used to combine available observations with numerical simulations to obtain statistically accurate and reliable state representations in dynamical systems. However, it is well known that the commonly used Gaussian distribution assumption introduces biases for state variables that admit discontinuous profiles, which are prevalent in nonlinear partial differential equations. This investigation designs a new structurally informed non-Gaussian prior that exploits statistical information from the simulated state variables. In particular, we construct a new weighting matrix based on the second moment of the gradient information of the state variable to replace the prior covariance matrix used for model/data compromise in the ETKF data assimilation framework. We further adapt our weighting matrix to include information in discontinuity regions via a clustering technique. Our numerical experiments demonstrate that this new approach yields more accurate estimates than those obtained using ETKF on shallow water equations, even when ETKF is enhanced with inflation and localization techniques.
翻译:集成变换卡尔曼滤波(ETKF)数据同化常被用于将观测数据与数值模拟相结合,以在动力系统中获得统计上准确可靠的状态表征。然而,众所周知,常用的高斯分布假设会为具有不连续剖面的状态变量引入偏差,这在非线性偏微分方程中普遍存在。本研究设计了一种新的基于结构的非高斯先验,利用模拟状态变量的统计信息。具体而言,我们基于状态变量梯度信息的二阶矩构建新的权重矩阵,以替代ETKF数据同化框架中用于模型/数据折中的先验协方差矩阵。进一步地,我们通过聚类技术自适应调整权重矩阵以包含不连续区域的信息。数值实验表明,即使ETKF采用膨胀和定位技术进行增强,本方法在浅水方程上仍能获得比ETKF更精确的估计。