Few-shot sequence labeling aims to identify novel classes based on only a few labeled samples. Existing methods solve the data scarcity problem mainly by designing token-level or span-level labeling models based on metric learning. However, these methods are only trained at a single granularity (i.e., either token level or span level) and have some weaknesses of the corresponding granularity. In this paper, we first unify token and span level supervisions and propose a Consistent Dual Adaptive Prototypical (CDAP) network for few-shot sequence labeling. CDAP contains the token-level and span-level networks, jointly trained at different granularities. To align the outputs of two networks, we further propose a consistent loss to enable them to learn from each other. During the inference phase, we propose a consistent greedy inference algorithm that first adjusts the predicted probability and then greedily selects non-overlapping spans with maximum probability. Extensive experiments show that our model achieves new state-of-the-art results on three benchmark datasets.
翻译:少样本序列标注旨在仅通过少量标注样本识别新类别。现有方法主要通过设计基于度量学习的令牌级或跨度级标注模型来解决数据稀缺问题。然而,这些方法仅在单一粒度(即令牌级或跨度级)下训练,存在对应粒度的局限性。本文首次统一令牌级与跨度级监督,提出一致双重自适应原型网络(CDAP)用于少样本序列标注。CDAP包含令牌级与跨度级网络,以不同粒度联合训练。为实现两个网络输出对齐,我们进一步提出一致性损失函数,使其能够相互学习。在推理阶段,我们提出一致性贪婪推理算法,先调整预测概率,再贪婪选择概率最大的非重叠跨度。大量实验表明,我们的模型在三个基准数据集上取得了新的最优结果。