We propose a neuralized undirected graphical model called Neural-Hidden-CRF to solve the weakly-supervised sequence labeling problem. Under the umbrella of probabilistic undirected graph theory, the proposed Neural-Hidden-CRF embedded with a hidden CRF layer models the variables of word sequence, latent ground truth sequence, and weak label sequence with the global perspective that undirected graphical models particularly enjoy. In Neural-Hidden-CRF, we can capitalize on the powerful language model BERT or other deep models to provide rich contextual semantic knowledge to the latent ground truth sequence, and use the hidden CRF layer to capture the internal label dependencies. Neural-Hidden-CRF is conceptually simple and empirically powerful. It obtains new state-of-the-art results on one crowdsourcing benchmark and three weak-supervision benchmarks, including outperforming the recent advanced model CHMM by 2.80 F1 points and 2.23 F1 points in average generalization and inference performance, respectively.
翻译:我们提出一种名为神经-隐条件随机场的神经化无向图模型,用于解决弱监督序列标注问题。在概率无向图理论框架下,所提出的神经-隐条件随机场通过嵌入隐条件随机场层,以无向图模型特有的全局视角对词语序列、潜在真实标签序列和弱标签序列进行建模。在神经-隐条件随机场中,我们可利用强大的语言模型BERT或其他深度模型为潜在真实标签序列提供丰富的上下文语义知识,并通过隐条件随机场层捕获内部标签依赖关系。神经-隐条件随机场概念简单且经验性能强大。该方法在一个众包基准测试和三个弱监督基准测试中取得了新的最优结果,包括在平均泛化性能和推理性能上分别超越近期先进模型CHMM达2.80个F1分数点和2.23个F1分数点。