Should we test the model assumptions before running a model-based test? - 专知论文

会员服务 ·

0

基于模型的测试 · 统计学 · 组合测试 · 统计方法 ·

2023 年 4 月 17 日

Should we test the model assumptions before running a model-based test?

翻译：我们是否应在运行基于模型的检验前检验模型假设？

M. Iqbal Shamsudheen,Christian Hennig

from arxiv, 35 pages, 1 figure

Statistical methods are based on model assumptions, and it is statistical folklore that a method's model assumptions should be checked before applying it. This can be formally done by running one or more misspecification tests of model assumptions before running a method that requires these assumptions; here we focus on model-based tests. A combined test procedure can be defined by specifying a protocol in which first model assumptions are tested and then, conditionally on the outcome, a test is run that requires or does not require the tested assumptions. Although such an approach is often taken in practice, much of the literature that investigated this is surprisingly critical of it. Our aim is to explore conditions under which model checking is advisable or not advisable. For this, we review results regarding such "combined procedures" in the literature, we review and discuss controversial views on the role of model checking in statistics, and we present a general setup in which we can show that preliminary model checking is advantageous, which implies conditions for making model checking worthwhile.

翻译：统计方法基于模型假设，而统计学的常规认知是在应用方法前应检验其模型假设。这一过程可通过在运行需要某些假设的方法之前，先执行一个或多个模型假设的设定错误检验来正式实现；本文聚焦于基于模型的检验。通过规定一个协议，即先检验模型假设，再根据检验结果有条件地运行需要或不需要这些假设的检验，可以定义一种组合检验程序。尽管这种方法在实践中常被采用，但许多相关文献却对其持出人意料的批评态度。本文旨在探究模型检验可取或不可取的条件。为此，我们回顾了文献中关于此类“组合程序”的研究结果，梳理并讨论了统计学中关于模型检验角色的争议性观点，并提出了一个通用框架，在该框架下可证明初步的模型检验具有优势，从而明确使模型检验有价值所需的条件。

0

相关内容

基于模型的测试

基于模型的测试

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

43+阅读 · 2020年7月27日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

1+阅读 · 2022年6月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

基于LAMOST数据Mg超丰恒星的搜寻及研究

国家自然科学基金

0+阅读 · 2015年12月31日

CITED2在心脏干细胞衰老中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

HDAC6介导的乙酰化表观遗传修饰在PCOS胰岛素抵抗中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

IKIP在直肠癌术前放疗敏感性预测中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于模型的测试用例优化生成与自动执行

国家自然科学基金

0+阅读 · 2011年12月31日

以EGFR为识别靶位多靶点联合克服NSCLC EGFR TKIs耐药的基因干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于假设/保证自动推理的组合验证研究

国家自然科学基金

0+阅读 · 2009年12月31日

肾脏间充质干细胞在肾细胞癌血管生成多样性中的作用及机制

国家自然科学基金

0+阅读 · 2009年12月31日

A General Framework for Regression with Mismatched Data Based on Mixture Modeling

Arxiv

0+阅读 · 2023年6月1日

Image generation with shortest path diffusion

Arxiv

0+阅读 · 2023年6月1日

Time and Space Optimal Massively Parallel Algorithm for the 2-Ruling Set Problem

Arxiv

0+阅读 · 2023年6月1日

Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization

Arxiv

0+阅读 · 2023年5月31日

Solving Projected Model Counting by Utilizing Treewidth and its Limits

Arxiv

0+阅读 · 2023年5月30日

Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

Arxiv

0+阅读 · 2023年5月30日

Are Diffusion Models Vulnerable to Membership Inference Attacks?

Arxiv

0+阅读 · 2023年5月30日

How to Staff When Customers Arrive in Batches

Arxiv

0+阅读 · 2023年5月29日

Membership Inference Attacks against Language Models via Neighbourhood Comparison

Arxiv

0+阅读 · 2023年5月29日

Optimizing Airbnb Search Journey with Multi-task Learning

Arxiv

0+阅读 · 2023年5月28日

VIP会员

文章信息

相关主题

基于模型的测试

最新内容

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

专知会员服务

9+阅读 · 今天10:44

消耗优势：美军的“精确规模化”概念

消耗优势：美军的“精确规模化”概念

专知会员服务

5+阅读 · 今天10:12

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

专知会员服务

6+阅读 · 今天9:56

《网络空间兵棋推演：挑战、局限性与混合路径》报告

《网络空间兵棋推演：挑战、局限性与混合路径》报告

专知会员服务

5+阅读 · 今天10:09

《离线语言支持系统：面向空战战术决策》

《离线语言支持系统：面向空战战术决策》

专知会员服务

5+阅读 · 今天9:53

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

专知会员服务

4+阅读 · 今天9:51

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

专知会员服务

4+阅读 · 6月14日

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

4+阅读 · 6月14日

俄乌战场地面机器人如何改写战争规则

俄乌战场地面机器人如何改写战争规则

专知会员服务

9+阅读 · 6月14日

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

专知会员服务

9+阅读 · 6月14日

《新空中力量概念：来自敏捷战斗运用的启示》2026最新50页报告

《新空中力量概念：来自敏捷战斗运用的启示》2026最新50页报告

专知会员服务

12+阅读 · 6月14日

《无人水面艇文献综述与结构设计》135页

《无人水面艇文献综述与结构设计》135页

专知会员服务

13+阅读 · 6月13日

《自主蜂群系统的战略架构：多域一体化、抗毁韧性及海上作战框架（2025—2035）》46页报告

《自主蜂群系统的战略架构：多域一体化、抗毁韧性及海上作战框架（2025—2035）》46页报告

专知会员服务

11+阅读 · 6月13日

ICML 2026｜MEMOPILOT：用强化学习训练会进化的智能体记忆

ICML 2026｜MEMOPILOT：用强化学习训练会进化的智能体记忆

专知会员服务

2+阅读 · 6月13日

智能体时间序列系统全景综述：架构、可靠性与研究前沿

智能体时间序列系统全景综述：架构、可靠性与研究前沿

专知会员服务

11+阅读 · 6月13日

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

43+阅读 · 2020年7月27日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

消耗优势：美军的“精确规模化”概念

《网络空间兵棋推演：挑战、局限性与混合路径》报告

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

1+阅读 · 2022年6月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

A General Framework for Regression with Mismatched Data Based on Mixture Modeling

Arxiv

0+阅读 · 2023年6月1日

Image generation with shortest path diffusion

Arxiv

0+阅读 · 2023年6月1日

Time and Space Optimal Massively Parallel Algorithm for the 2-Ruling Set Problem

Arxiv

0+阅读 · 2023年6月1日

Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization

Arxiv

0+阅读 · 2023年5月31日

Solving Projected Model Counting by Utilizing Treewidth and its Limits

Arxiv

0+阅读 · 2023年5月30日

Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

Arxiv

0+阅读 · 2023年5月30日

Are Diffusion Models Vulnerable to Membership Inference Attacks?

Arxiv

0+阅读 · 2023年5月30日

How to Staff When Customers Arrive in Batches

Arxiv

0+阅读 · 2023年5月29日

Membership Inference Attacks against Language Models via Neighbourhood Comparison

Arxiv

0+阅读 · 2023年5月29日

Optimizing Airbnb Search Journey with Multi-task Learning

Arxiv

0+阅读 · 2023年5月28日

相关基金

基于LAMOST数据Mg超丰恒星的搜寻及研究

国家自然科学基金

0+阅读 · 2015年12月31日

CITED2在心脏干细胞衰老中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

HDAC6介导的乙酰化表观遗传修饰在PCOS胰岛素抵抗中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

IKIP在直肠癌术前放疗敏感性预测中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于模型的测试用例优化生成与自动执行

国家自然科学基金

0+阅读 · 2011年12月31日

以EGFR为识别靶位多靶点联合克服NSCLC EGFR TKIs耐药的基因干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于假设/保证自动推理的组合验证研究

国家自然科学基金

0+阅读 · 2009年12月31日

肾脏间充质干细胞在肾细胞癌血管生成多样性中的作用及机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员