Events refer to specific occurrences, incidents, or happenings that take place under a particular background. Event reasoning aims to infer events according to certain relations and predict future events. The cutting-edge techniques for event reasoning play a crucial role in various natural language processing applications. Large language models (LLMs) have made significant advancements in event reasoning owing to their wealth of knowledge and reasoning capabilities. However, smaller instruction-tuned models currently in use do not consistently demonstrate exceptional proficiency in managing these tasks. This discrepancy arises from the absence of explicit modeling of events and the interconnections of them within their instruction data. Consequently, these models face challenges in comprehending event structures and semantics while struggling to bridge the gap between their interpretations and human understanding of events. Additionally, their limitations in grasping event relations lead to constrained event reasoning abilities to effectively deduce and incorporate pertinent event knowledge. In this paper, we propose Event-Oriented Instruction Tuning (EvIT) to train our LLM. Specifically, we first propose a novel structure named event quadruple which contains the structure and semantics of events and is complete in the event representation. We then design event-relation learning based on the structures. We encapsulate the learning into the instruction-tuning formulation to better stimulate the event reasoning capacity of our model. We design a heuristic unsupervised method to mine event quadruple from a large-scale corpus. At last, we finetune a Llama model on our Event-Oriented Instruction Tuning. We conduct extensive experiments on event reasoning tasks on several datasets. Automatic and human evaluations demonstrate EvIT achieves competitive performances on event reasoning.
翻译:事件是指在特定背景下发生的具体事态、事件或情况。事件推理旨在根据特定关系推断事件并预测未来事件。事件推理的前沿技术在各种自然语言处理应用中发挥着至关重要的作用。大语言模型凭借其丰富的知识与推理能力,在事件推理领域取得了显著进展。然而,当前使用的小型指令微调模型在管理这些任务时并未始终展现出卓越的胜任能力。这种差异源于其指令数据中缺乏对事件及其相互关联的显式建模。因此,这些模型在理解事件结构与语义方面面临挑战,难以弥合其解释与人类对事件理解之间的差距。此外,它们在掌握事件关系方面的局限性导致事件推理能力受限,无法有效推演并整合相关事件知识。本文提出事件导向指令微调方法以训练大语言模型。具体而言,我们首先提出一种名为事件四元组的新颖结构,该结构包含事件的结构与语义信息,且事件表示完整。随后基于该结构设计事件关系学习机制,并将其封装至指令微调框架中,以更有效地激发模型的事件推理能力。我们设计了一种启发式无监督方法,从大规模语料库中挖掘事件四元组。最后,基于事件导向指令微调对Llama模型进行微调。我们在多个数据集上对事件推理任务进行了广泛实验。自动评估与人工评估表明,EvIT在事件推理任务上取得了具有竞争力的性能。