We present a new additive method, nicknamed sage for Simplified Additive Gaussian processes Emulator, to emulate climate model Perturbed Parameter Ensembles (PPEs). It estimates the value of a climate model output as the sum of additive terms. Each additive term is the mean of a Gaussian Process, and corresponds to the impact of a parameter or parameter group on the variable of interest. This design caters to the sparsity of PPEs which are characterized by limited ensemble members and high dimensionality of the parameter space. sage quantifies the variability explained by different parameters and parameter groups, providing additional insights on the parameter-climate model output relationship. We apply the method to two climate model PPEs and compare it to a fully connected Neural Network. The two methods have comparable performance with both PPEs, but sage provides insights on parameter and parameter group importance as well as diagnostics useful for optimizing PPE design. Insights gained are valid regardless of the emulator method used, and have not been previously addressed. Our work highlights that analyzing the PPE used to train an emulator is different from analyzing data generated from an emulator trained on the PPE, as the former provides more insights on the data structure in the PPE which could help inform the emulator design.
翻译:我们提出了一种新的加法方法,简称为sage(简化加法高斯过程仿真器),用于仿真气候模型的扰动参数集合。该方法将气候模型输出的值估计为加法项之和。每个加法项是一个高斯过程的均值,对应于某个参数或参数组对目标变量的影响。这种设计针对扰动参数集合的稀疏性而优化,此类集合通常具有有限的集合成员和参数空间的高维特性。sage能够量化不同参数及参数组所解释的变异性,从而为参数与气候模型输出之间的关系提供额外见解。我们将该方法应用于两个气候模型扰动参数集合,并与全连接神经网络进行比较。两种方法在两个扰动参数集合上表现出相当的性能,但sage能够提供关于参数及参数组重要性的见解,以及可用于优化扰动参数集合设计的诊断信息。所获得的见解与所使用的仿真器方法无关,且此前尚未被探讨。我们的工作强调,分析用于训练仿真器的扰动参数集合与分析由基于该集合训练的仿真器生成的数据是不同的,因为前者能更深入地揭示扰动参数集合中的数据结构,这有助于指导仿真器的设计。