Frontiers to the learning of nonparametric hidden Markov models

Hidden Markov models (HMMs) are flexible tools for clustering dependent data coming from unknown populations, allowing nonparametric modelling of the population densities. Identifiability fails when the data is in fact independent, and we study the frontier between learnable and unlearnable two-state nonparametric HMMs. Interesting new phenomena emerge when the cluster distributions are modelled via density functions (the 'emission' densities) belonging to standard smoothness classes compared to the multinomial setting. Notably, in contrast to the multinomial setting previously considered, the identification of a direction separating the two emission densities becomes a critical, and challenging, issue. Surprisingly, it is possible to "borrow strength" from estimators of the smoother density to improve estimation of the other. We conduct precise analysis of minimax rates, showing a transition depending on the relative smoothnesses of the emission densities.

翻译：隐马尔可夫模型（HMMs）是用于对来自未知总体的相依数据进行聚类的灵活工具，允许对总体密度进行非参数建模。当数据实际独立时，模型的可识别性失效，本文研究了可学习与不可学习的两状态非参数HMMs之间的边界。与多项分布情形相比，当聚类分布通过属于标准光滑类别的密度函数（即"发射"密度）进行建模时，出现了有趣的新现象。值得注意的是，与先前考虑的多项分布情形不同，识别区分两个发射密度的方向成为一个关键且具有挑战性的问题。令人惊讶的是，可以利用较光滑密度估计器的"强度借用"来改进另一个密度的估计。我们对极小化极大速率进行了精确分析，展示了依赖于发射密度相对光滑度的转变。

相关内容

隐马尔科夫模型

关注 18

隐马儿可夫模型：HMM，hidden Markov model，是可用于标注问题的统计学习模型，描述由隐藏的马尔可夫链随机生成观测序列的过程，属于生成模型。隐马尔可夫模型是关于时序的概率模型，描述由一个隐藏的马尔可夫链随机生成不可观测的状态随机序列，再有各个状态生成一个观测而产生观测随机序列的过程。隐藏的马尔可夫链随机生成的状态的序列，称为状态序列。每个状态生成一个观测，而由此产生的观测的随机序列，称为观测序列。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日