HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks

This paper presents the submission of Huawei Translation Services Center (HW-TSC) to machine translation tasks of the 20th China Conference on Machine Translation (CCMT 2024). We participate in the bilingual machine translation task and multi-domain machine translation task. For these two translation tasks, we use training strategies such as regularized dropout, bidirectional training, data diversification, forward translation, back translation, alternated training, curriculum learning, and transductive ensemble learning to train neural machine translation (NMT) models based on the deep Transformer-big architecture. Furthermore, to explore whether large language model (LLM) can help improve the translation quality of NMT systems, we use supervised fine-tuning to train llama2-13b as an Automatic post-editing (APE) model to improve the translation results of the NMT model on the multi-domain machine translation task. By using these plyometric strategies, our submission achieves a competitive result in the final evaluation.

翻译：本文介绍了华为翻译服务中心（HW-TSC）向第二十届全国机器翻译大会（CCMT 2024）机器翻译任务提交的系统。我们参与了双语机器翻译任务和多领域机器翻译任务。针对这两项翻译任务，我们采用了正则化丢弃、双向训练、数据多样化、前向翻译、反向翻译、交替训练、课程学习以及转导集成学习等训练策略，基于深层Transformer-big架构训练神经机器翻译模型。此外，为探究大语言模型能否帮助提升NMT系统的翻译质量，我们采用监督微调方法训练llama2-13b作为自动后编辑模型，以改进NMT模型在多领域机器翻译任务上的翻译结果。通过运用这些增强策略，我们的提交在最终评测中取得了具有竞争力的成绩。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日