A Goodness-of-Fit Test for Independent Component Models in High Dimensions

Independent component (IC) models are a standard tool for representing multivariate data in statistics, signal processing, and machine learning. Despite the extensive use of IC models, much less attention has been given to goodness-of-fit tests for assessing their compatibility with data. We develop the first goodness-of-fit test for IC models that is supported by a theoretical validity guarantee when the data dimension and sample size diverge proportionally. This is made possible by the fact that the test does not rely on a pre-whitening step, which often limits the applicability of other goodness-of-fit tests in high dimensions. Our theoretical analysis is complemented with numerical experiments that demonstrate the test's size control and power under a range of conditions. In addition, we provide examples involving gene-expression data to illustrate that the test has potential for effective diagnostic use in practice.

翻译：独立成分（IC）模型是统计学、信号处理与机器学习中表示多变量数据的标准工具。尽管IC模型被广泛使用，但针对其与数据兼容性的拟合优度检验研究却相对较少。我们首次提出了一种IC模型的拟合优度检验方法，该方法在数据维度和样本量成比例增长时具有理论有效性保证。这一进展得益于该检验无需依赖预白化步骤——这一步骤常限制其他拟合优度检验在高维场景中的适用性。我们的理论分析辅以数值实验，表明该检验在多种条件下具有良好的尺寸控制能力和统计功效。此外，通过基因表达数据的实例分析，证明该检验在实际诊断应用中具有显著潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【牛津大学博士论文】组合优化和接触追踪的模仿学习，229页pdf

专知会员服务

28+阅读 · 2023年11月14日

《深度模型融合》综述

专知会员服务

75+阅读 · 2023年9月28日

【剑桥大学博士论文】模型不确定性下的统计假设检验，198页pdf

专知会员服务

26+阅读 · 2023年2月7日

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

专知会员服务

47+阅读 · 2022年10月31日