过参数化模型的插值信息准则 (The Interpolating Information Criterion for Overparameterized Models) - 专知论文

会员服务 ·

0

参数化 · 准则 · 参数化模型 · 模型选择 · 相同 ·

The Interpolating Information Criterion for Overparameterized Models

翻译：过参数化模型的插值信息准则

Liam Hodgkinson,Chris van der Heide,Robert Salomone,Fred Roosta,Michael W. Mahoney

from arxiv, 23 pages, 2 figures

The problem of model selection is considered for the setting of interpolating estimators, where the number of model parameters exceeds the size of the dataset. Classical information criteria typically consider the large-data limit, penalizing model size. However, these criteria are not appropriate in modern settings where overparameterized models tend to perform well. For any overparameterized model, we show that there exists a dual underparameterized model that possesses the same marginal likelihood, thus establishing a form of Bayesian duality. This enables more classical methods to be used in the overparameterized setting, revealing the Interpolating Information Criterion, a measure of model quality that naturally incorporates the choice of prior into the model selection. Our new information criterion accounts for prior misspecification, geometric and spectral properties of the model, and is numerically consistent with known empirical and theoretical behavior in this regime.

翻译：本文研究了插值估计器背景下的模型选择问题，其中模型参数数量超过数据集规模。经典信息准则通常考虑大数据极限，对模型规模施加惩罚。然而，这些准则不适用于过参数化模型往往表现优异的现代场景。对于任意过参数化模型，我们证明存在一个对偶的欠参数化模型具有相同的边缘似然，从而建立了一种贝叶斯对偶形式。这使得更经典的方法能够应用于过参数化场景，并揭示了插值信息准则——一种自然将先验选择纳入模型选择的模型质量度量标准。我们提出的新信息准则考虑了先验设定错误、模型的几何与谱特性，其数值结果与该领域已知的经验和理论行为保持一致。

0

相关内容

参数化

【罗切斯特Yuqian Zhang等书】从对称到几何:可处理的非凸问题，34页pdf，From Symmetry to Geometry: Tractable Nonconvex Problems

【罗切斯特Yuqian Zhang等书】从对称到几何:可处理的非凸问题，34页pdf，From Symmetry to Geometry: Tractable Nonconvex Problems

专知会员服务

20+阅读 · 2022年3月4日

【CMU-Yuejie Chi等干货书】满足低秩矩阵分解的非凸优化综述，69页pdf，Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

【CMU-Yuejie Chi等干货书】满足低秩矩阵分解的非凸优化综述，69页pdf，Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

专知会员服务

33+阅读 · 2022年3月4日

【ICML2021】基于低秩重参数化的大规模私有学习

专知会员服务

12+阅读 · 2021年6月20日

【NeurIPS2020】无限可能的联合对比学习

专知会员服务

29+阅读 · 2020年10月2日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

【CVPR2021】跨模态检索的概率嵌入

【CVPR2021】跨模态检索的概率嵌入

专知

17+阅读 · 2021年3月2日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

误差反向传播——CNN

误差反向传播——CNN

统计学习与视觉计算组

30+阅读 · 2018年7月12日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

MNIST入门：贝叶斯方法

MNIST入门：贝叶斯方法

Python程序员

23+阅读 · 2017年7月3日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

4+阅读 · 2014年12月31日

Estimators for Substitution Rates in Genomes from Read Data

Arxiv

0+阅读 · 1月12日

A Kernelization-Based Approach to Nonparametric Binary Choice Models

Arxiv

0+阅读 · 1月11日

An Empirical Investigation of Robustness in Large Language Models under Tabular Distortions

Arxiv

0+阅读 · 1月8日

Exponentially Consistent Low Complexity Tests for Outlier Hypothesis Testing

Arxiv

0+阅读 · 1月8日

Measuring Uncertainty Calibration

Arxiv

0+阅读 · 1月7日

VIP会员

文章信息

相关主题

参数化模型

相关VIP内容

【罗切斯特Yuqian Zhang等书】从对称到几何:可处理的非凸问题，34页pdf，From Symmetry to Geometry: Tractable Nonconvex Problems

【罗切斯特Yuqian Zhang等书】从对称到几何:可处理的非凸问题，34页pdf，From Symmetry to Geometry: Tractable Nonconvex Problems

专知会员服务

20+阅读 · 2022年3月4日

【CMU-Yuejie Chi等干货书】满足低秩矩阵分解的非凸优化综述，69页pdf，Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

【CMU-Yuejie Chi等干货书】满足低秩矩阵分解的非凸优化综述，69页pdf，Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

专知会员服务

33+阅读 · 2022年3月4日

【ICML2021】基于低秩重参数化的大规模私有学习

专知会员服务

12+阅读 · 2021年6月20日

【NeurIPS2020】无限可能的联合对比学习

专知会员服务

29+阅读 · 2020年10月2日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

热门VIP内容

开通专知VIP会员享更多权益服务

机器人领域的多任务泛化研究

美国战争部人工智能加速战略

《科研智能发展报告（2025年）》发布

法律领域中的大语言模型智能体：分类体系、应用场景与挑战

相关资讯

【CVPR2021】跨模态检索的概率嵌入

【CVPR2021】跨模态检索的概率嵌入

专知

17+阅读 · 2021年3月2日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

误差反向传播——CNN

误差反向传播——CNN

统计学习与视觉计算组

30+阅读 · 2018年7月12日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

MNIST入门：贝叶斯方法

MNIST入门：贝叶斯方法

Python程序员

23+阅读 · 2017年7月3日

相关论文

Estimators for Substitution Rates in Genomes from Read Data

Arxiv

0+阅读 · 1月12日

A Kernelization-Based Approach to Nonparametric Binary Choice Models

Arxiv

0+阅读 · 1月11日

An Empirical Investigation of Robustness in Large Language Models under Tabular Distortions

Arxiv

0+阅读 · 1月8日

Exponentially Consistent Low Complexity Tests for Outlier Hypothesis Testing

Arxiv

0+阅读 · 1月8日

Measuring Uncertainty Calibration

Arxiv

0+阅读 · 1月7日

相关基金

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

4+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员