Semantic Frame Induction with Deep Metric Learning

Recent studies have demonstrated the usefulness of contextualized word embeddings in unsupervised semantic frame induction. However, they have also revealed that generic contextualized embeddings are not always consistent with human intuitions about semantic frames, which causes unsatisfactory performance for frame induction based on contextualized embeddings. In this paper, we address supervised semantic frame induction, which assumes the existence of frame-annotated data for a subset of predicates in a corpus and aims to build a frame induction model that leverages the annotated data. We propose a model that uses deep metric learning to fine-tune a contextualized embedding model, and we apply the fine-tuned contextualized embeddings to perform semantic frame induction. Our experiments on FrameNet show that fine-tuning with deep metric learning considerably improves the clustering evaluation scores, namely, the B-cubed F-score and Purity F-score, by about 8 points or more. We also demonstrate that our approach is effective even when the number of training instances is small.

翻译：近期的研究表明，上下文词嵌入在无监督语义框架归纳中具有实用价值。然而，这些研究也揭示出通用上下文嵌入并不总是与人类对语义框架的直觉保持一致，这导致基于上下文嵌入的框架归纳性能不尽人意。本文研究有监督语义框架归纳问题，该任务假设语料库中部分谓词存在框架标注数据，旨在构建能够利用这些标注数据的框架归纳模型。我们提出了一种采用深度度量学习微调上下文嵌入模型的方案，并应用微调后的上下文嵌入进行语义框架归纳。在FrameNet上的实验表明，深度度量学习微调显著提升了聚类评估指标——B立方F分数和纯度F分数提升约8个百分点以上。我们还证明即使在训练实例数量较少的情况下，该方法依然有效。

相关内容

度量学习

关注 3379

度量学习的目的为了衡量样本之间的相近程度，而这也正是模式识别的核心问题之一。大量的机器学习方法，比如K近邻、支持向量机、径向基函数网络等分类方法以及K-means聚类方法，还有一些基于图的方法，其性能好坏都主要有样本之间的相似度量方法的选择决定。度量学习通常的目标是使同类样本之间的距离尽可能缩小，不同类样本之间的距离尽可能放大。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

专知会员服务

47+阅读 · 2019年12月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日