On optimality of Mallows model averaging

In the past decades, model averaging (MA) has attracted much attention as it has emerged as an alternative tool to the model selection (MS) statistical approach. Hansen [\emph{Econometrica} \textbf{75} (2007) 1175--1189] introduced a Mallows model averaging (MMA) method with model weights selected by minimizing a Mallows' $C_p$ criterion. The main theoretical justification for MMA is an asymptotic optimality (AOP), which states that the risk/loss of the resulting MA estimator is asymptotically equivalent to that of the best but infeasible averaged model. MMA's AOP is proved in the literature by either constraining weights in a special discrete weight set or limiting the number of candidate models. In this work, it is first shown that under these restrictions, however, the optimal risk of MA becomes an unreachable target, and MMA may converge more slowly than MS. In this background, a foundational issue that has not been addressed is: When a suitably large set of candidate models is considered, and the model weights are not harmfully constrained, can the MMA estimator perform asymptotically as well as the optimal convex combination of the candidate models? We answer this question in a nested model setting commonly adopted in the area of MA. We provide finite sample inequalities for the risk of MMA and show that without unnatural restrictions on the candidate models, MMA's AOP holds in a general continuous weight set under certain mild conditions. Several specific methods for constructing the candidate model sets are proposed. Implications on minimax adaptivity are given as well. The results from simulations back up our theoretical findings.

翻译：过去几十年来，模型平均（MA）作为模型选择（MS）统计方法的替代工具引起了广泛关注。Hansen [《计量经济学》**75** (2007) 1175–1189] 提出了一种Mallows模型平均（MMA）方法，通过最小化Mallows的$C_p$准则来选择模型权重。MMA的主要理论依据是渐近最优性（AOP），即所得到的MA估计量的风险/损失在渐近意义上等价于最优但不可实现平均模型的风险/损失。现有文献中，MMA的AOP要么通过将权重约束在特殊离散权重集中得到证明，要么通过限制候选模型数量来证明。然而，本文首先证明，在这些限制条件下，MA的最优风险成为不可达到的目标，且MMA的收敛速度可能慢于MS。在此背景下，一个尚未解决的基础性问题是：当考虑适当广泛的候选模型集且模型权重未受有害约束时，MMA估计量能否渐近地达到候选模型最优凸组合的表现？我们在模型平均领域常用的嵌套模型设定下回答了这一问题。我们给出了MMA风险的有穷样本不等式，并证明在不对候选模型施加非自然限制的条件下，MMA的AOP在一般连续权重集中得以在特定温和条件下成立。提出了若干构建候选模型集的具体方法，并讨论了其对极小极大适应性的含义。模拟结果进一步支持了我们的理论发现。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日