Inference, especially those derived from inductive processes, is a crucial component in our conversation to complement the information implicitly or explicitly conveyed by a speaker. While recent large language models show remarkable advances in inference tasks, their performance in inductive reasoning, where not all information is present in the context, is far behind deductive reasoning. In this paper, we analyze the behavior of the models based on the task difficulty defined by the semantic information gap -- which distinguishes inductive and deductive reasoning (Johnson-Laird, 1988, 1993). Our analysis reveals that the disparity in information between dialogue contexts and desired inferences poses a significant challenge to the inductive inference process. To mitigate this information gap, we investigate a contrastive learning approach by feeding negative samples. Our experiments suggest negative samples help models understand what is wrong and improve their inference generations.
翻译:推理,尤其是源自归纳过程的推理,是我们在对话中补充说话者隐含或明确表达信息的关键组成部分。尽管近期大型语言模型在推理任务上展现出显著进步,但它们在归纳推理(即上下文未提供全部信息)方面的表现远逊于演绎推理。本文基于语义信息差(Johnson-Laird, 1988, 1993)定义的任务难度,分析了模型的行为特征。我们的分析表明,对话语境与期望推理之间的信息差异对归纳推理过程构成了重大挑战。为缓解这一信息差,我们研究了一种通过输入负样本的对比学习方法。实验表明,负样本有助于模型理解错误所在,并改善其推理生成质量。