Likelihood-Free Frequentist Inference: Confidence Sets with Correct Conditional Coverage - 专知论文

会员服务 ·

0

置信度 · 频率主义学派 · 统计量 · 覆盖 · 推断 ·

2023 年 1 月 29 日

Likelihood-Free Frequentist Inference: Confidence Sets with Correct Conditional Coverage

翻译：无似然频率推断：具有正确条件覆盖率的置信集

Niccolò Dalmasso,Luca Masserano,David Zhao,Rafael Izbicki,Ann B. Lee

from arxiv, 59 pages, 14 figures, code available at https://github.com/Mr8ND/ACORE-LFI

Many areas of science make extensive use of computer simulators that implicitly encode likelihood functions of complex systems. Classical statistical methods are poorly suited for these so-called likelihood-free inference (LFI) settings, particularly outside asymptotic and low-dimensional regimes. Although new machine learning methods, such as normalizing flows, have revolutionized the sample efficiency and capacity of LFI methods, it remains an open question whether they produce confidence sets with correct conditional coverage for small sample sizes. This paper unifies classical statistics with modern machine learning to present (i) a practical procedure for the Neyman construction of confidence sets with finite-sample guarantees of nominal coverage, and (ii) diagnostics that estimate conditional coverage over the entire parameter space. We refer to our framework as likelihood-free frequentist inference (LF2I). Any method that defines a test statistic, like the likelihood ratio, can leverage the LF2I machinery to create valid confidence sets and diagnostics without costly Monte Carlo samples at fixed parameter settings. We study the power of two test statistics (ACORE and BFF), which, respectively, maximize versus integrate an odds function over the parameter space. Our paper discusses the benefits and challenges of LF2I, with a breakdown of the sources of errors in LF2I confidence sets.

翻译：科学领域的许多研究广泛使用计算机模拟器，这些模拟器隐式编码了复杂系统的似然函数。经典统计方法难以适用于这些所谓无似然推断（LFI）场景，尤其是在渐近和高维范围之外的情形。尽管近年来诸如归一化流等新型机器学习方法极大提升了LFI方法的样本效率和容量，但其在小样本量下能否生成具有正确条件覆盖率的置信集仍是悬而未决的问题。本文通过统一经典统计学与现代机器学习，提出：（i）一种用于内曼构造置信集的实用流程，可保证有限样本下的名义覆盖率；（ii）能够估计整个参数空间上条件覆盖率的诊断方法。我们将该框架称为无似然频率推断（LF2I）。任何定义检验统计量的方法（如似然比）均可借助LF2I机制构建有效置信集与诊断，而无需在固定参数设置下进行昂贵的蒙特卡洛采样。我们研究了两种检验统计量（ACORE和BFF）的功效，它们分别通过最大化与积分参数空间上的比值函数来工作。本文讨论了LF2I的优势与挑战，并详细解析了LF2I置信集中误差的来源。

0

相关内容

置信度

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

同型半胱氨酸经ERK通路上调ETB受体表达促血管平滑肌细胞增殖机制

国家自然科学基金

0+阅读 · 2015年12月31日

Triptolide诱导c-FLIP选择性剪切在调控TRAIL耐药胰腺癌细胞凋亡中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SIRT1介导组蛋白乙酰化在同型半胱氨酸致动脉粥样硬化中的作用及特异性miRNAs调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

Brd2调控巨噬细胞新的死亡方式—pyroptosis在动脉粥样硬化中的作用和机制

国家自然科学基金

0+阅读 · 2013年12月31日

不完全信息代谢网络广义和混杂系统建模及优化控制

国家自然科学基金

0+阅读 · 2013年12月31日

阻尼波动方程的调和分析方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钙池调控钙内流对丁酸钠诱导大肠癌细胞凋亡调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

玻色-爱因斯坦凝聚中集体激发的Landau阻尼和频移

国家自然科学基金

0+阅读 · 2008年12月31日

汽车撞击时损伤的最小化

国家自然科学基金

0+阅读 · 2008年12月31日

Exponential Consistency of M-estimators in Generalized Linear Mixed Models

Arxiv

0+阅读 · 2023年3月22日

High-Dimensional Inference for Generalized Linear Models with Hidden Confounding

Arxiv

0+阅读 · 2023年3月21日

A causal inference framework for spatial confounding

Arxiv

0+阅读 · 2023年3月21日

The Sparse Dynamic Factor Model: A Regularised Quasi-Maximum Likelihood Approach

Arxiv

0+阅读 · 2023年3月21日

Risk-Sensitive Reinforcement Learning with Exponential Criteria

Arxiv

0+阅读 · 2023年3月21日

Estimating Conditional Distributions with Neural Networks using R package deeptrafo

Arxiv

0+阅读 · 2023年3月20日

Estimation and inference for the Wasserstein distance between mixing measures in topic models

Arxiv

0+阅读 · 2023年3月17日

Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

Arxiv

0+阅读 · 2023年3月17日

Towards Reliable Neural Specifications

Arxiv

0+阅读 · 2023年3月17日

On Function-on-Scalar Quantile Regression

Arxiv

0+阅读 · 2023年3月16日

VIP会员

文章信息

相关主题

频率主义学派

最新内容

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

10+阅读 · 今天7:25

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

3+阅读 · 今天6:54

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

3+阅读 · 今天6:52

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

2+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

7+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

6+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

9+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

10+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

10+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

巡飞弹与反无人机系统——现代战场的两大支柱

《北约数字教官网络发展路径》128页报告

无人机自主控制与人工智能：系统性综述

《打造“黄金舰队”》57页报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Exponential Consistency of M-estimators in Generalized Linear Mixed Models

Arxiv

0+阅读 · 2023年3月22日

High-Dimensional Inference for Generalized Linear Models with Hidden Confounding

Arxiv

0+阅读 · 2023年3月21日

A causal inference framework for spatial confounding

Arxiv

0+阅读 · 2023年3月21日

The Sparse Dynamic Factor Model: A Regularised Quasi-Maximum Likelihood Approach

Arxiv

0+阅读 · 2023年3月21日

Risk-Sensitive Reinforcement Learning with Exponential Criteria

Arxiv

0+阅读 · 2023年3月21日

Estimating Conditional Distributions with Neural Networks using R package deeptrafo

Arxiv

0+阅读 · 2023年3月20日

Estimation and inference for the Wasserstein distance between mixing measures in topic models

Arxiv

0+阅读 · 2023年3月17日

Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

Arxiv

0+阅读 · 2023年3月17日

Towards Reliable Neural Specifications

Arxiv

0+阅读 · 2023年3月17日

On Function-on-Scalar Quantile Regression

Arxiv

0+阅读 · 2023年3月16日

相关基金

同型半胱氨酸经ERK通路上调ETB受体表达促血管平滑肌细胞增殖机制

国家自然科学基金

0+阅读 · 2015年12月31日

Triptolide诱导c-FLIP选择性剪切在调控TRAIL耐药胰腺癌细胞凋亡中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SIRT1介导组蛋白乙酰化在同型半胱氨酸致动脉粥样硬化中的作用及特异性miRNAs调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

Brd2调控巨噬细胞新的死亡方式—pyroptosis在动脉粥样硬化中的作用和机制

国家自然科学基金

0+阅读 · 2013年12月31日

不完全信息代谢网络广义和混杂系统建模及优化控制

国家自然科学基金

0+阅读 · 2013年12月31日

阻尼波动方程的调和分析方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钙池调控钙内流对丁酸钠诱导大肠癌细胞凋亡调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

玻色-爱因斯坦凝聚中集体激发的Landau阻尼和频移

国家自然科学基金

0+阅读 · 2008年12月31日

汽车撞击时损伤的最小化

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员