A Probably Approximately Correct Analysis of Group Testing Algorithms

We consider the problem of identifying the defectives from a population of items via a non-adaptive group testing framework with a random pooling-matrix design. We analyze the sufficient number of tests needed for approximate set identification, i.e., for identifying almost all the defective and non-defective items with high confidence. To this end, we view the group testing problem as a function learning problem and develop our analysis using the probably approximately correct (PAC) framework. Using this formulation, we derive sufficiency bounds on the number of tests for three popular binary group testing algorithms: column matching, combinatorial basis pursuit, and definite defectives. We compare the derived bounds with the existing ones in the literature for exact recovery theoretically and using simulations. Finally, we contrast the three group testing algorithms under consideration in terms of the sufficient testing rate surface and the sufficient number of tests contours across the range of the approximation and confidence levels.

翻译：本文研究在随机池化矩阵设计的非自适应群体测试框架下，从物品总体中识别缺陷品的问题。我们分析了近似集合识别所需的充分测试次数，即在高置信度下识别几乎所有缺陷品与非缺陷品所需的条件。为此，我们将群体测试问题视为函数学习问题，并基于概率近似正确（PAC）框架展开分析。通过该形式化方法，我们推导了三种主流二元群体测试算法所需测试次数的充分性界：列匹配算法、组合基追踪算法及确定缺陷品算法。我们将所得理论界与文献中现有精确恢复的界进行了理论比较与仿真验证。最后，我们从近似度与置信度参数范围内的充分测试率曲面和充分测试次数等值线两个维度，对比了所研究的三种群体测试算法的性能。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日