Classifier-Based Nonparametric Sequential Hypothesis Testing

We consider the problem of constructing sequential power-one tests where the null and alternative classes are specified indirectly through historical or offline data. More specifically, given an offline dataset consisting of observations from $L+1$ distributions $\{P_0, P_1, \ldots, P_L\}$, and a new unlabeled data stream $\{X_t: t \geq 1\} \overset{i.i.d}{\sim} P_θ$, the goal is to decide between the null $H_0: θ= 0$, against the alternative $H_1: θ\in [L]:=\{1,\ldots,L\}$. Our main methodological contribution is a general approach for designing a level-$α$ power-one test for this problem using a multi-class classifier trained on the given offline dataset. Working under a mild "separability" condition on the distributions and the trained classifier, we obtain an upper bound on the expected stopping time of our proposed level-$α$ test, and then show that in general this cannot be improved. In addition to rejecting the null, we show that our procedure can also identify the true underlying distribution almost surely. We then establish a sufficient condition to ensure the required separability of the classifier, and provide some converse results to investigate the role of the size of the offline dataset and the family of classifiers among classifier-based tests that satisfy the level-$α$ power-one criterion. Finally, we present an extension of our analysis for the training-and-testing distribution mismatch and illustrate an application to sequential change detection. Empirical results using both synthetic and real data provide support for our theoretical results.

翻译：本文考虑构造序贯幂1检验的问题，其中零假设和备择类通过历史数据或离线数据间接指定。具体而言，给定由$L+1$个分布$\{P_0, P_1, \ldots, P_L\}$的观测值组成的离线数据集，以及一个未标注的新数据流$\{X_t: t \geq 1\} \overset{i.i.d}{\sim} P_θ$，目标是判定零假设$H_0: θ= 0$与备择假设$H_1: θ\in [L]:=\{1,\ldots,L\}$。我们的主要方法学贡献是提出一种通用方法，利用在给定离线数据集上训练的多分类器设计该问题的水平-α幂1检验。在分布与训练分类器满足温和的“可分离性”条件时，我们得到了所提水平-α检验期望停止时间的上界，并证明了该上界在一般情况下不可改进。除拒绝零假设外，我们还证明本方法能以概率1识别真实分布。随后，我们建立了确保分类器所需可分离性的充分条件，并给出部分逆结果以探究离线数据集规模与分类器族在满足水平-α幂1准则的基于分类器检验中的作用。最后，我们扩展分析了训练-测试分布失配情况，并展示了在序贯变化检测中的应用。基于合成数据与实际数据的实证结果支持了我们的理论结论。

相关内容

分类器

关注 6

分类是数据挖掘的一种非常重要的方法。分类的概念是在已有数据的基础上学会一个分类函数或构造出一个分类模型（即我们通常所说的分类器(Classifier)）。该函数或模型能够把数据库中的数据纪录映射到给定类别中的某一个，从而可以应用于数据预测。总之，分类器是数据挖掘中对样本进行分类的方法的统称，包含决策树、逻辑回归、朴素贝叶斯、神经网络等算法。

分布外如何检测？东大等最新《视觉语言模型时代的广义异常检测及其拓展》综述

专知会员服务

25+阅读 · 2024年8月2日

【NeurIPS2023】半监督端到端对比学习用于时间序列分类

专知会员服务

37+阅读 · 2023年10月17日

弹药异常检测《使用机器学习进行缺陷表征》最佳论文，MODSIM World 2023

专知会员服务

37+阅读 · 2023年7月22日

【剑桥大学博士论文】模型不确定性下的统计假设检验，198页pdf

专知会员服务

26+阅读 · 2023年2月7日