An Information-Spectrum Approach to Distributed Hypothesis Testing for General Sources

This paper investigates Distributed Hypothesis testing (DHT), in which a source $\mathbf{X}$ is encoded given that side information $\mathbf{Y}$ is available at the decoder only. Based on the received coded data, the receiver aims to decide on the two hypotheses $H_0$ or $H_1$ related to the joint distribution of $\mathbf{X}$ and $\mathbf{Y}$. While most existing contributions in the literature on DHT consider i.i.d. assumptions, this paper assumes more generic, non-i.i.d., non-stationary, and non-ergodic sources models. It relies on information-spectrum tools to provide general formulas on the achievable Type-II error exponent under a constraint on the Type-I error. The achievability proof is based on a quantize-and-binning scheme. It is shown that with the quantize-and-binning approach, the error exponent boils down to a trade-off between a binning error and a decision error, as already observed for the i.i.d. sources. The last part of the paper provides error exponents for particular source models, \emph{e.g.}, Gaussian, stationary, and ergodic models.

翻译：本文研究分布式假设检验问题，其中信源 $\mathbf{X}$ 在译码器仅可获取边信息 $\mathbf{Y}$ 的条件下进行编码。接收端根据接收的编码数据，旨在判定与 $\mathbf{X}$ 和 $\mathbf{Y}$ 联合分布相关的两种假设 $H_0$ 或 $H_1$。现有文献中大多数分布式假设检验的研究均假设独立同分布情形，而本文则考虑更具一般性的非独立同分布、非平稳及非遍历信源模型。本文利用信息谱工具，在给定第一类错误约束条件下，给出了可达第二类错误指数的通用公式。可达性证明基于量化-分箱方案。研究表明，采用量化-分箱方法时，错误指数可归结为分箱错误与决策错误之间的权衡，这一现象在独立同分布信源中已被观察到。文章最后部分给出了特定信源模型（如高斯、平稳及遍历模型）的错误指数。

相关内容

分布式哈希表技术

关注 0

分布式哈希表技术(Distributed Hash Table)简称DHT,类似Tracker的根据种子特征码返回种子信息的网络·是一种分布式存储方法。在不需要服务器的情况下，每个客户端负责一个小范围的路由，并负责存储一小部分数据，从而实现整个DHT网络的寻址和存储。新版BitComet允许同行连接DHT网络和Tracker，也就是说在完全不连上[Tracker服务器的情况下，也可以很好的下载，因为它可以在DHT网络中寻找下载同一文件的其他用户。BitComet的DHT网络协议和BitTorrent今年5月测试版的协议完全兼容，也就是说可以连入一个同DHT网络分享数据。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日