On Bilingual Lexicon Induction with Large Language Models

Bilingual Lexicon Induction (BLI) is a core task in multilingual NLP that still, to a large extent, relies on calculating cross-lingual word representations. Inspired by the global paradigm shift in NLP towards Large Language Models (LLMs), we examine the potential of the latest generation of LLMs for the development of bilingual lexicons. We ask the following research question: Is it possible to prompt and fine-tune multilingual LLMs (mLLMs) for BLI, and how does this approach compare against and complement current BLI approaches? To this end, we systematically study 1) zero-shot prompting for unsupervised BLI and 2) few-shot in-context prompting with a set of seed translation pairs, both without any LLM fine-tuning, as well as 3) standard BLI-oriented fine-tuning of smaller LLMs. We experiment with 18 open-source text-to-text mLLMs of different sizes (from 0.3B to 13B parameters) on two standard BLI benchmarks covering a range of typologically diverse languages. Our work is the first to demonstrate strong BLI capabilities of text-to-text mLLMs. The results reveal that few-shot prompting with in-context examples from nearest neighbours achieves the best performance, establishing new state-of-the-art BLI scores for many language pairs. We also conduct a series of in-depth analyses and ablation studies, providing more insights on BLI with (m)LLMs, also along with their limitations.

翻译：双语词汇归纳（BLI）是多语言自然语言处理中的核心任务，至今仍很大程度上依赖于跨语言词向量的计算。受自然语言处理领域向大型语言模型（LLMs）全球性范式转变的启发，我们探究了最新一代LLMs在构建双语词汇表方面的潜力。我们提出以下研究问题：能否通过提示工程和微调多语言LLMs（mLLMs）来完成BLI任务？这种方法与现有BLI方法相比如何，又能否形成互补？为此，我们系统研究了：1）面向无监督BLI的零样本提示方法；2）基于种子翻译对集合的少样本上下文内提示方法（两者均未涉及LLM微调）；3）面向标准BLI任务的小型LLM微调。我们采用18个不同参数量（3亿至130亿）的开源文本到文本mLLMs，在覆盖多种类型学语言的两个标准BLI基准上开展实验。本研究首次证明了文本到文本mLLMs具备强大的BLI能力。结果表明，采用最近邻上下文示例的少样本提示方法取得了最佳性能，为众多语言对树立了新的BLI评分标杆。我们还通过一系列深度分析和消融实验，从（多语言）LLMs视角为BLI提供了更深入的见解，同时指出了其局限性。

相关内容

大语言模型

关注 67

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日