Improving LLM-based Machine Translation with Systematic Self-Correction

Large Language Models (LLMs) have achieved impressive results in Machine Translation (MT). However, careful evaluations by human reveal that the translations produced by LLMs still contain multiple errors. Importantly, feeding back such error information into the LLMs can lead to self-correction and result in improved translation performance. Motivated by these insights, we introduce a systematic LLM-based self-correcting translation framework, named TER, which stands for Translate, Estimate, and Refine, marking a significant step forward in this direction. Our findings demonstrate that 1) our self-correction framework successfully assists LLMs in improving their translation quality across a wide range of languages, whether it's from high-resource languages to low-resource ones or whether it's English-centric or centered around other languages; 2) TER exhibits superior systematicity and interpretability compared to previous methods; 3) different estimation strategies yield varied impacts on AI feedback, directly affecting the effectiveness of the final corrections. We further compare different LLMs and conduct various experiments involving self-correction and cross-model correction to investigate the potential relationship between the translation and evaluation capabilities of LLMs.

翻译：大语言模型（LLMs）在机器翻译（MT）领域取得了令人瞩目的成果。然而，人工精细评估显示，LLMs生成的翻译仍包含多种错误。重要的是，将这些错误信息反馈给LLMs可触发自我修正，从而提升翻译性能。受此启发，我们提出了一种基于LLM的系统性自我修正翻译框架TER（即翻译、评估与精炼的英文缩写），标志着该方向的重要进展。研究发现：1）我们的自我修正框架成功帮助LLMs提升各类语言对的翻译质量，无论是高资源语言到低资源语言，还是以英语为中心或其他语言为中心的场景；2）与先前方法相比，TER展现出更强的系统性和可解释性；3）不同评估策略对AI反馈产生差异化影响，直接决定最终修正效果。我们进一步比较了不同LLMs，并通过自我修正与跨模型修正的系列实验，探究LLMs翻译能力与评估能力之间的潜在关联。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日