Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis - 专知论文

会员服务 ·

0

Performer · Machine Translation · Analysis · MoDELS · 过估计 ·

2023 年 5 月 2 日

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

翻译：多语言机器翻译与大语言模型：实证结果与分析

Wenhao Zhu,Hongyi Liu,Qingxiu Dong,Jingjing Xu,Shujian Huang,Lingpeng Kong,Jiajun Chen,Lei Li

Large language models (LLMs) have demonstrated remarkable potential in handling multilingual machine translation (MMT). In this paper, we systematically investigate the advantages and challenges of LLMs for MMT by answering two questions: 1) How well do LLMs perform in translating a massive number of languages? 2) Which factors affect LLMs' performance in translation? We evaluate popular LLMs, including XGLM, OPT, BLOOMZ, and ChatGPT, on 102 languages. Our empirical results show that even the best model ChatGPT still lags behind the supervised baseline NLLB in 83.33% of translation directions. Through further analysis, we discover that LLMs exhibit new working patterns when used for MMT. First, prompt semantics can surprisingly be ignored when given in-context exemplars, where LLMs still show strong performance even with unreasonable prompts. Second, cross-lingual exemplars can provide better task instruction for low-resource translation than exemplars in the same language pairs. Third, we observe the overestimated performance of BLOOMZ on dataset Flores-101, indicating the potential risk when using public datasets for evaluation.

翻译：大语言模型在处理多语言机器翻译方面展现出显著潜力。本文通过回答两个问题系统探究了大语言模型在多语言机器翻译中的优势与挑战：1）大语言模型在翻译海量语言时的表现如何？2）哪些因素影响大语言模型的翻译性能？我们评估了XGLM、OPT、BLOOMZ和ChatGPT等主流大语言模型在102种语言上的表现。实证结果表明，即使是最优模型ChatGPT，在83.33%的翻译方向上仍落后于有监督基线模型NLLB。通过进一步分析，我们发现大语言模型在多语言机器翻译中呈现出新的工作模式：其一，在给定上下文示例时，提示语义可被意外忽略——即便使用不合理提示，大语言模型仍展现强劲性能；其二，跨语言示例相比同语言对示例能为低资源翻译提供更优任务指令；其三，我们观察到BLOOMZ在Flores-101数据集上的性能被高估，这揭示了使用公开数据集进行评估时的潜在风险。

0

相关内容

Performer

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

微波法合成α-Fe2O3单晶纳米线及其在太阳能光解水电池中的应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

金属化含能材料中金属自钝化及界面反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离子体共振增强的新型高效太阳能电池

国家自然科学基金

0+阅读 · 2013年12月31日

基于能量传递的宽谱调制近红外下转换纳米发光薄膜研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离子激元吸收增强型太阳能电池机理的超快光谱研究

国家自然科学基金

0+阅读 · 2012年12月31日

松友饮对肝动脉化疗栓塞治疗后残癌细胞上皮间质转化的影响及其分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

结合硝化反应和超临界流体萃取的高温气冷堆燃料元件中UO2芯球的处理

国家自然科学基金

0+阅读 · 2010年12月31日

大承气汤调控AR42J细胞凋亡-坏死转换分子开关的转化研究

国家自然科学基金

0+阅读 · 2009年12月31日

二甲醚催化水汽重整产氢原位低温还原NOx的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Understanding Deep Generative Models with Generalized Empirical Likelihoods

Arxiv

0+阅读 · 2023年6月16日

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Arxiv

0+阅读 · 2023年6月15日

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models

Arxiv

0+阅读 · 2023年6月15日

Opportunities for Large Language Models and Discourse in Engineering Design

Arxiv

0+阅读 · 2023年6月15日

Reducing Hallucinations in Neural Machine Translation with Feature Attribution

Arxiv

0+阅读 · 2023年6月14日

COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in Language Models

Arxiv

0+阅读 · 2023年6月14日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

Arxiv

10+阅读 · 2020年10月6日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

VIP会员

文章信息

相关主题

Machine Translation

最新内容

从领域适配到部署与可解释：Berkeley博士论文解析大语言模型真实落地

从领域适配到部署与可解释：Berkeley博士论文解析大语言模型真实落地

专知会员服务

0+阅读 · 6分钟前

综述 | 长程智能体研究全景：基础、演化、框架、优化与前沿

综述 | 长程智能体研究全景：基础、演化、框架、优化与前沿

专知会员服务

0+阅读 · 11分钟前

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

专知会员服务

4+阅读 · 7月17日

《边缘端实时无线感知赋能现场多机器人部署》200页

《边缘端实时无线感知赋能现场多机器人部署》200页

专知会员服务

5+阅读 · 7月17日

战力倍增器：自主武器系统与乌克兰及加沙冲突

战力倍增器：自主武器系统与乌克兰及加沙冲突

专知会员服务

4+阅读 · 7月17日

人工智能赋能战场情报：提速决策进程

人工智能赋能战场情报：提速决策进程

专知会员服务

2+阅读 · 7月17日

《拥抱新兴技术：面向未来军官的教育革新》

《拥抱新兴技术：面向未来军官的教育革新》

专知会员服务

5+阅读 · 7月17日

ACM MM 2026 | MAR-GRPO：稳定混合图像生成的强化学习训练

ACM MM 2026 | MAR-GRPO：稳定混合图像生成的强化学习训练

专知会员服务

2+阅读 · 7月17日

综述 | 大模型水印理论与部署：来源追踪、攻击鲁棒与可信治理

综述 | 大模型水印理论与部署：来源追踪、攻击鲁棒与可信治理

专知会员服务

3+阅读 · 7月17日

《火线上的后勤保障：对抗环境下的随机规划模型研究——俄乌场景案例分析》99页

《火线上的后勤保障：对抗环境下的随机规划模型研究——俄乌场景案例分析》99页

专知会员服务

11+阅读 · 7月16日

《无人地面战车（UGV）的崛起》报告

《无人地面战车（UGV）的崛起》报告

专知会员服务

7+阅读 · 7月16日

《无人机参数化与集群飞行创新项目的监控流程管理：模型、策略及自适应解决方案》

《无人机参数化与集群飞行创新项目的监控流程管理：模型、策略及自适应解决方案》

专知会员服务

6+阅读 · 7月16日

《美军开放式任务系统（OMS）定义与文档（D&D）——Java关键抽象层（CAL）接口生成规范》47页标准

《美军开放式任务系统（OMS）定义与文档（D&D）——Java关键抽象层（CAL）接口生成规范》47页标准

专知会员服务

13+阅读 · 7月16日

美陆军任务式指挥人工智能解决方案

美陆军任务式指挥人工智能解决方案

专知会员服务

13+阅读 · 7月16日

ICML 2026 | 理论级自动形式化：从孤立命题到统一形式化知识库

ICML 2026 | 理论级自动形式化：从孤立命题到统一形式化知识库

专知会员服务

9+阅读 · 7月16日

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 长程智能体研究全景：基础、演化、框架、优化与前沿

《边缘端实时无线感知赋能现场多机器人部署》200页

从领域适配到部署与可解释：Berkeley博士论文解析大语言模型真实落地

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Understanding Deep Generative Models with Generalized Empirical Likelihoods

Arxiv

0+阅读 · 2023年6月16日

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Arxiv

0+阅读 · 2023年6月15日

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models

Arxiv

0+阅读 · 2023年6月15日

Opportunities for Large Language Models and Discourse in Engineering Design

Arxiv

0+阅读 · 2023年6月15日

Reducing Hallucinations in Neural Machine Translation with Feature Attribution

Arxiv

0+阅读 · 2023年6月14日

COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in Language Models

Arxiv

0+阅读 · 2023年6月14日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

Arxiv

10+阅读 · 2020年10月6日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

相关基金

微波法合成α-Fe2O3单晶纳米线及其在太阳能光解水电池中的应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

金属化含能材料中金属自钝化及界面反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离子体共振增强的新型高效太阳能电池

国家自然科学基金

0+阅读 · 2013年12月31日

基于能量传递的宽谱调制近红外下转换纳米发光薄膜研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离子激元吸收增强型太阳能电池机理的超快光谱研究

国家自然科学基金

0+阅读 · 2012年12月31日

松友饮对肝动脉化疗栓塞治疗后残癌细胞上皮间质转化的影响及其分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

结合硝化反应和超临界流体萃取的高温气冷堆燃料元件中UO2芯球的处理

国家自然科学基金

0+阅读 · 2010年12月31日

大承气汤调控AR42J细胞凋亡-坏死转换分子开关的转化研究

国家自然科学基金

0+阅读 · 2009年12月31日

二甲醚催化水汽重整产氢原位低温还原NOx的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员