First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI

Natural Language Inference (NLI) tasks require identifying the relationship between sentence pairs, typically classified as entailment, contradiction, or neutrality. While the current state-of-the-art (SOTA) model, Entailment Few-Shot Learning (EFL), achieves a 93.1% accuracy on the Stanford Natural Language Inference (SNLI) dataset, further advancements are constrained by the dataset's limitations. To address this, we propose a novel approach leveraging synthetic data augmentation to enhance dataset diversity and complexity. We present UnitedSynT5, an advanced extension of EFL that leverages a T5-based generator to synthesize additional premise-hypothesis pairs, which are rigorously cleaned and integrated into the training data. These augmented examples are processed within the EFL framework, embedding labels directly into hypotheses for consistency. We train a GTR-T5-XL model on this expanded dataset, achieving a new benchmark of 94.7% accuracy on the SNLI dataset, 94.01% accuracy on the E-SNLI dataset, and 92.57% accuracy on the MultiNLI dataset, surpassing the previous SOTA models. This research demonstrates the potential of synthetic data augmentation in improving NLI models, offering a path forward for further advancements in natural language understanding tasks.

翻译：自然语言推理任务需要识别句子对之间的关系，通常分为蕴含、矛盾或中立三类。虽然当前最先进的模型——蕴含少样本学习在斯坦福自然语言推理数据集上达到了93.1%的准确率，但数据集的局限性制约了进一步突破。为此，我们提出一种利用合成数据增强来提高数据集多样性与复杂度的新方法。我们提出了UnitedSynT5——EFL的进阶扩展模型，它采用基于T5的生成器合成额外的前提-假设对，经过严格清洗后整合到训练数据中。这些增强样本在EFL框架内进行处理，将标签直接嵌入假设以保持一致性。我们在扩展数据集上训练GTR-T5-XL模型，在SNLI数据集上取得94.7%准确率的新基准，在E-SNLI数据集上达到94.01%准确率，在MultiNLI数据集上获得92.57%准确率，全面超越了先前的最先进模型。本研究证明了合成数据增强在改进自然语言推理模型方面的潜力，为自然语言理解任务的进一步发展提供了新路径。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日