First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI

Natural Language Inference (NLI) tasks require identifying the relationship between sentence pairs, typically classified as entailment, contradiction, or neutrality. While the current state-of-the-art (SOTA) model, Entailment Few-Shot Learning (EFL), achieves a 93.1% accuracy on the Stanford Natural Language Inference (SNLI) dataset, further advancements are constrained by the dataset's limitations. To address this, we propose a novel approach leveraging synthetic data augmentation to enhance dataset diversity and complexity. We present UnitedSynT5, an advanced extension of EFL that leverages a T5-based generator to synthesize additional premise-hypothesis pairs, which are rigorously cleaned and integrated into the training data. These augmented examples are processed within the EFL framework, embedding labels directly into hypotheses for consistency. We train a GTR-T5-XL model on this expanded dataset, achieving a new benchmark of 94.7% accuracy on the SNLI dataset, 94.0% accuracy on the E-SNLI dataset, and 92.6% accuracy on the MultiNLI dataset, surpassing the previous SOTA models. This research demonstrates the potential of synthetic data augmentation in improving NLI models, offering a path forward for further advancements in natural language understanding tasks.

翻译：自然语言推理（NLI）任务需要识别句子对之间的关系，通常分为蕴含、矛盾或中立三类。虽然当前最先进的（SOTA）模型——蕴含少样本学习（EFL）在斯坦福自然语言推理（SNLI）数据集上达到了93.1%的准确率，但数据集的局限性制约了进一步的提升。为解决这一问题，我们提出了一种利用合成数据增强来提高数据集多样性和复杂性的新方法。我们介绍了UnitedSynT5，这是EFL的一个高级扩展，它利用基于T5的生成器合成额外的前提-假设对，这些数据经过严格清洗并整合到训练数据中。这些增强的示例在EFL框架内进行处理，将标签直接嵌入假设中以保持一致性。我们在扩展后的数据集上训练了一个GTR-T5-XL模型，在SNLI数据集上达到了94.7%的准确率，在E-SNLI数据集上达到94.0%，在MultiNLI数据集上达到92.6%，超越了之前的SOTA模型。这项研究展示了合成数据增强在改进NLI模型方面的潜力，为自然语言理解任务的进一步发展提供了路径。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日