Model aggregation: minimizing empirical variance outperforms minimizing empirical error

Whether deterministic or stochastic, models can be viewed as functions designed to approximate a specific quantity of interest. We propose a data-driven framework that aggregates predictions from diverse models into a single, more accurate output. This aggregation approach exploits each model's strengths to enhance overall accuracy. It is non-intrusive - treating models as black-box functions - model-agnostic, requires minimal assumptions, and can combine outputs from a wide range of models, including those from machine learning and numerical solvers. We argue that the aggregation process should be point-wise linear and propose two methods to find an optimal aggregate: Minimal Error Aggregation (MEA), which minimizes the aggregate's prediction error, and Minimal Variance Aggregation (MVA), which minimizes its variance. While MEA is inherently more accurate when correlations between models and the target quantity are perfectly known, Minimal Empirical Variance Aggregation (MEVA), an empirical version of MVA - consistently outperforms Minimal Empirical Error Aggregation (MEEA), the empirical counterpart of MEA, when these correlations must be estimated from data. The key difference is that MEVA constructs an aggregate by estimating model errors, while MEEA treats the models as features for direct interpolation of the quantity of interest. This makes MEEA more susceptible to overfitting and poor generalization, where the aggregate may underperform individual models during testing. We demonstrate the versatility and effectiveness of our framework in various applications, such as data science and partial differential equations, showing how it successfully integrates traditional solvers with machine learning models to improve both robustness and accuracy.

翻译：无论确定性还是随机性模型，均可视为旨在逼近特定目标量的函数。我们提出一种数据驱动框架，将来自不同模型的预测聚合为单一且更精确的输出。该聚合方法通过利用各模型的优势来提升整体精度。它具有非侵入性——将模型视为黑箱函数、模型无关性、所需假设极少，并且能够整合来自广泛模型（包括机器学习和数值求解器）的输出。我们认为聚合过程应是逐点线性的，并提出两种寻找最优聚合的方法：最小误差聚合（MEA），其最小化聚合的预测误差；以及最小方差聚合（MVA），其最小化聚合的方差。虽然当模型与目标量之间的相关性完全已知时，MEA本质上更精确，但在这些相关性必须从数据中估计的情况下，最小经验方差聚合（MEVA）——MVA的经验版本——始终优于最小经验误差聚合（MEEA），即MEA的经验对应方法。关键区别在于，MEVA通过估计模型误差来构建聚合，而MEEA则将模型视为直接插值目标量的特征。这使得MEEA更容易出现过拟合和泛化能力差的问题，导致聚合结果在测试时可能逊于单个模型。我们在数据科学和偏微分方程等多种应用中展示了该框架的通用性和有效性，说明了它如何成功地将传统求解器与机器学习模型相结合，从而同时提升鲁棒性和精度。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【机器学习术语宝典】机器学习中英文术语表

专知会员服务

61+阅读 · 2020年7月12日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日