Language models (LMs) that jointly generate end-task answers as well as free-text rationales are known as self-rationalization models. Recent works demonstrate great performance gain for self-rationalization by few-shot prompting LMs with rationale-augmented exemplars. However, the ability to benefit from explanations only emerges with large-scale LMs, which have poor accessibility. In this work, we explore the less-studied setting of leveraging explanations for small LMs to improve few-shot self-rationalization. We first revisit the relationship between rationales and answers. Inspired by the implicit mental process of how human beings assess explanations, we present a novel approach, Zero-shot Augmentation of Rationale-Answer pairs (ZARA), to automatically construct pseudo-parallel data for self-training by reducing the problem of plausibility judgement to natural language inference. Experimental results show ZARA achieves SOTA performance on the FEB benchmark, for both the task accuracy and the explanation metric. In addition, we conduct human and quantitative evaluation validating ZARA's ability to automatically identify plausible and accurate rationale-answer pairs.
翻译:语言模型(LM)若同时生成端任务答案以及自由文本解释,则被称为自解释模型。近期研究表明,通过少样本提示结合解释增强示例,自解释模型性能显著提升。然而,从解释中获益的能力仅在大规模LM中涌现,这类模型可访问性差。本文探索了利用解释改善小型语言模型少样本自解释这一较少研究的场景。我们首先重新审视解释与答案的关系。受人类评估解释时潜在心理过程的启发,我们提出一种新方法——零样本增强解释-答案对(ZARA),通过将合理性判定问题简化为自然语言推理,自动构建用于自训练的伪平行数据。实验结果表明,ZARA在FEB基准测试中,任务准确率和解释指标均达到最优性能。此外,我们通过人工与定量评估验证了ZARA自动识别合理且准确的解释-答案对的能力。