AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment

Empirical evidence suggests that LLMs exhibit spontaneous cross-lingual alignment. Our findings suggest that although LLMs also demonstrate promising cross-lingual alignment in Information Extraction, there remains significant imbalance across languages, revealing an underlying deficiency in the IE alignment. To address this issue, we propose AlignXIE, a powerful code-based LLM that significantly enhances cross-lingual IE alignment through two strategies. Firstly, AlignXIE formulates IE across different languages, especially non-English ones, as code generation tasks, standardizing the representation of various schemas using Python classes to ensure consistency of the same ontology in different languages and align the schema. Secondly, it incorporates an IE cross-lingual alignment phase through a translated instance prediction task proposed in this paper to align the extraction process, utilizing ParallelNER, an IE bilingual parallel dataset with 257,190 samples, generated by our proposed LLM-based automatic pipeline for IE parallel data construction, with manual annotation to ensure quality. Ultimately, we obtain AlignXIE through multilingual IE instruction tuning. Although without training in 9 unseen languages, AlignXIE surpasses ChatGPT by $30.17\%$ and SoTA by $20.03\%$, thereby demonstrating superior cross-lingual IE capabilities. Comprehensive evaluations on 63 IE benchmarks in Chinese and English under various settings, demonstrate that AlignXIE significantly enhances cross-lingual and multilingual IE through boosting the IE alignment.

翻译：实证研究表明，大语言模型展现出自主的跨语言对齐能力。我们的发现表明，尽管大语言模型在信息抽取任务中也表现出有前景的跨语言对齐特性，但不同语言间仍存在显著的不平衡，揭示了信息抽取对齐的内在不足。为解决这一问题，我们提出了AlignXIE——一个基于代码的强大大语言模型，它通过两种策略显著增强了跨语言信息抽取对齐。首先，AlignXIE将不同语言（尤其是非英语语言）的信息抽取任务形式化为代码生成任务，利用Python类标准化各类模式的表示，以确保相同本体在不同语言中的一致性并对齐模式定义。其次，模型引入了一个跨语言信息抽取对齐阶段，通过本文提出的翻译实例预测任务来对齐抽取过程；该阶段使用了ParallelNER（一个包含257,190个样本的信息抽取双语平行数据集），该数据集由我们提出的大语言模型驱动的自动平行数据构建流程生成，并辅以人工标注以确保质量。最终，我们通过多语言信息抽取指令微调获得了AlignXIE。尽管未在9种未见语言上进行训练，AlignXIE的性能仍超越ChatGPT达30.17%，并优于当前最优模型20.03%，从而展现出卓越的跨语言信息抽取能力。在中文和英文的63个信息抽取基准上的全面评估表明，AlignXIE通过提升信息抽取对齐度，显著增强了跨语言及多语言信息抽取性能。