Bootstrapping multiple systems estimates to account for model selection - 专知论文

会员服务 ·

0

模型选择 · 模型估计 · 多模型 · 准则 · 多模 ·

2023 年 3 月 31 日

Bootstrapping multiple systems estimates to account for model selection

翻译：多系统估计中的自助法以考虑模型选择

Bernard W. Silverman,Kyle Vincent,Lax Chan

from arxiv, 21 pages, 5 figures, 6 tables

Multiple systems estimation is a standard approach to quantifying hidden populations where data sources are based on lists of known cases. A typical modelling approach is to fit a Poisson loglinear model to the numbers of cases observed in each possible combination of the lists. It is necessary to decide which interaction parameters to include in the model, and information criterion approaches are often used for model selection. Difficulties in the context of multiple systems estimation may arise due to sparse or nil counts based on the intersection of lists, and care must be taken when information criterion approaches are used for model selection due to issues relating to the existence of estimates and identifiability of the model. Confidence intervals are often reported conditional on the model selected, providing an over-optimistic impression of the accuracy of the estimation. A bootstrap approach is a natural way to account for the model selection procedure. However, because the model selection step has to be carried out for every bootstrap replication, there may be a high or even prohibitive computational burden. We explore the merit of modifying the model selection procedure in the bootstrap to look only among a subset of models, chosen on the basis of their information criterion score on the original data. This provides large computational gains with little apparent effect on inference. Another model selection approach considered and investigated is a downhill search approach among models, possibly with multiple starting points.

翻译：多系统估计是一种量化隐藏人群的标准方法，其中数据来源基于已知案例的名单。典型的建模方法是对每个可能名单组合中观察到的案例数量拟合泊松对数线性模型。需要决定模型中包含哪些交互参数，信息准则方法常被用于模型选择。在多系统估计的背景下，由于基于名单交集的稀疏或零计数，可能出现困难；并且由于估计存在性和模型可辨识性问题，使用信息准则方法进行模型选择时必须谨慎。通常报告的条件于所选模型的置信区间会给人过于乐观的估计精度印象。自助法是一种自然的方式来考虑模型选择过程。然而，由于每个自助复制都必须执行模型选择步骤，可能会产生很高甚至无法承受的计算负担。我们探讨了在自助法中修改模型选择程序的价值，即仅基于原始数据的信息准则评分，在模型子集中进行选择。这带来了巨大的计算收益，且对推断几乎没有明显影响。另一种被考虑和研究的模型选择方法是模型中的下坡搜索方法，可能采用多个起始点。

0

相关内容

模型选择

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【简明书】数学，统计和机器学习的动手入门，57页pdf，A Hands-On Introduction to Math, Stats, and Machine Learning

【简明书】数学，统计和机器学习的动手入门，57页pdf，A Hands-On Introduction to Math, Stats, and Machine Learning

专知会员服务

43+阅读 · 2022年2月26日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

专知会员服务

32+阅读 · 2020年2月1日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

15+阅读 · 2019年11月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

机器之心

0+阅读 · 2022年10月7日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

缺失数据统计分析，第三版，462页pdf

缺失数据统计分析，第三版，462页pdf

专知

50+阅读 · 2020年2月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于复合分位数回归和最大秩相关想法的ROC回归曲线估计

国家自然科学基金

0+阅读 · 2013年12月31日

超高维半参数回归模型的结构识别和变量选择问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂制造过程中轮廓数据监控方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

连续时间马氏决策过程均值-方差优化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

含有缺失值的纵向数据回归模型的稳健推断

国家自然科学基金

3+阅读 · 2012年12月31日

半参数回归分析的随机函数法及其高维情形

国家自然科学基金

2+阅读 · 2012年12月31日

基于参数和半参数回归模型的小区域估计问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

用多重假设检验方法来研究方差变点问题

国家自然科学基金

0+阅读 · 2009年12月31日

区间删失数据下竞争风险模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

Lightweight Online Learning for Sets of Related Problems in Automated Reasoning

Arxiv

0+阅读 · 2023年5月22日

Approximating a RUM from Distributions on k-Slates

Arxiv

0+阅读 · 2023年5月22日

Quantifying the effect of X-ray scattering for data generation in real-time defect detection

Arxiv

0+阅读 · 2023年5月22日

A parametric distribution for exact post-selection inference with data carving

Arxiv

0+阅读 · 2023年5月21日

Precise Unbiased Estimation in Randomized Experiments using Auxiliary Observational Data

Arxiv

0+阅读 · 2023年5月19日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月19日

Towards Intersectional Moderation: An Alternative Model of Moderation Built on Care and Power

Arxiv

0+阅读 · 2023年5月18日

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

Arxiv

0+阅读 · 2023年5月17日

VIP会员

文章信息

相关主题

最新内容

《面向指挥控制训练与实时北约兼容数据分发的战术模拟器》

《面向指挥控制训练与实时北约兼容数据分发的战术模拟器》

专知会员服务

1+阅读 · 今天5:21

《决策模型比较研究》

《决策模型比较研究》

专知会员服务

5+阅读 · 今天5:16

全球军事与武器工业中的人工智能：应用、方法与影响（万字长文）

全球军事与武器工业中的人工智能：应用、方法与影响（万字长文）

专知会员服务

2+阅读 · 今天4:37

《美军水下战与海床战概述及本地实施》

《美军水下战与海床战概述及本地实施》

专知会员服务

2+阅读 · 今天4:30

面向未来冲突推进陆军情报体制改革

面向未来冲突推进陆军情报体制改革

专知会员服务

2+阅读 · 今天4:12

人工智能赋能无人机：俄乌冲突案例及其深远影响（万字长文）

人工智能赋能无人机：俄乌冲突案例及其深远影响（万字长文）

专知会员服务

3+阅读 · 今天2:54

《反无人机蜂群：有人-无人协同防御场景下的编队重构分析》

《反无人机蜂群：有人-无人协同防御场景下的编队重构分析》

专知会员服务

7+阅读 · 7月24日

《史诗怒火/咆哮雄狮行动：针对伊朗空中战役的战略分析》68页智库报告

《史诗怒火/咆哮雄狮行动：针对伊朗空中战役的战略分析》68页智库报告

专知会员服务

6+阅读 · 7月24日

“愈演愈烈的欺骗与干扰博弈”：无人机与人工智能背景下俄乌强化以无人机为核心的电子战

“愈演愈烈的欺骗与干扰博弈”：无人机与人工智能背景下俄乌强化以无人机为核心的电子战

专知会员服务

4+阅读 · 7月24日

乌克兰纵深打击如何重塑俄罗斯的战略选择

乌克兰纵深打击如何重塑俄罗斯的战略选择

专知会员服务

2+阅读 · 7月24日

《分布式太空任务对比分析与综合建模及仿真环境》120页

《分布式太空任务对比分析与综合建模及仿真环境》120页

专知会员服务

2+阅读 · 7月24日

俄乌战争中关于中程打击无人机部署的经验启示

俄乌战争中关于中程打击无人机部署的经验启示

专知会员服务

3+阅读 · 7月24日

《远程自主系统可扩展态势感知的解决方案》32页2026最新报告

《远程自主系统可扩展态势感知的解决方案》32页2026最新报告

专知会员服务

5+阅读 · 7月23日

《基于强化学习的自动化红队测试》

《基于强化学习的自动化红队测试》

专知会员服务

5+阅读 · 7月23日

《下一代无人机-卫星通信：人工智能创新与未来展望》32页长综述

《下一代无人机-卫星通信：人工智能创新与未来展望》32页长综述

专知会员服务

8+阅读 · 7月23日

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【简明书】数学，统计和机器学习的动手入门，57页pdf，A Hands-On Introduction to Math, Stats, and Machine Learning

【简明书】数学，统计和机器学习的动手入门，57页pdf，A Hands-On Introduction to Math, Stats, and Machine Learning

专知会员服务

43+阅读 · 2022年2月26日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

专知会员服务

32+阅读 · 2020年2月1日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

15+阅读 · 2019年11月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《决策模型比较研究》

《美军水下战与海床战概述及本地实施》

《面向指挥控制训练与实时北约兼容数据分发的战术模拟器》

全球军事与武器工业中的人工智能：应用、方法与影响（万字长文）

相关资讯

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

机器之心

0+阅读 · 2022年10月7日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

缺失数据统计分析，第三版，462页pdf

缺失数据统计分析，第三版，462页pdf

专知

50+阅读 · 2020年2月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Lightweight Online Learning for Sets of Related Problems in Automated Reasoning

Arxiv

0+阅读 · 2023年5月22日

Approximating a RUM from Distributions on k-Slates

Arxiv

0+阅读 · 2023年5月22日

Quantifying the effect of X-ray scattering for data generation in real-time defect detection

Arxiv

0+阅读 · 2023年5月22日

A parametric distribution for exact post-selection inference with data carving

Arxiv

0+阅读 · 2023年5月21日

Precise Unbiased Estimation in Randomized Experiments using Auxiliary Observational Data

Arxiv

0+阅读 · 2023年5月19日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月19日

Towards Intersectional Moderation: An Alternative Model of Moderation Built on Care and Power

Arxiv

0+阅读 · 2023年5月18日

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

Arxiv

0+阅读 · 2023年5月17日

相关基金

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于复合分位数回归和最大秩相关想法的ROC回归曲线估计

国家自然科学基金

0+阅读 · 2013年12月31日

超高维半参数回归模型的结构识别和变量选择问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂制造过程中轮廓数据监控方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

连续时间马氏决策过程均值-方差优化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

含有缺失值的纵向数据回归模型的稳健推断

国家自然科学基金

3+阅读 · 2012年12月31日

半参数回归分析的随机函数法及其高维情形

国家自然科学基金

2+阅读 · 2012年12月31日

基于参数和半参数回归模型的小区域估计问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

用多重假设检验方法来研究方差变点问题

国家自然科学基金

0+阅读 · 2009年12月31日

区间删失数据下竞争风险模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员