Context Consistency between Training and Testing in Simultaneous Machine Translation

Simultaneous Machine Translation (SiMT) aims to yield a real-time partial translation with a monotonically growing the source-side context. However, there is a counterintuitive phenomenon about the context usage between training and testing: e.g., the wait-k testing model consistently trained with wait-k is much worse than that model inconsistently trained with wait-k' (k' is not equal to k) in terms of translation quality. To this end, we first investigate the underlying reasons behind this phenomenon and uncover the following two factors: 1) the limited correlation between translation quality and training (cross-entropy) loss; 2) exposure bias between training and testing. Based on both reasons, we then propose an effective training approach called context consistency training accordingly, which makes consistent the context usage between training and testing by optimizing translation quality and latency as bi-objectives and exposing the predictions to the model during the training. The experiments on three language pairs demonstrate our intuition: our system encouraging context consistency outperforms that existing systems with context inconsistency for the first time, with the help of our context consistency training approach.

翻译：同步机器翻译（SiMT）旨在通过单调递增的源端上下文生成实时部分翻译。然而，训练与测试中的上下文使用存在一个反直觉现象：例如，使用wait-k训练的测试模型在翻译质量上远差于使用wait-k'（k'≠k）不一致训练的模型。为此，我们首先探究了这一现象背后的原因，并发现以下两个因素：1）翻译质量与训练（交叉熵）损失之间的有限相关性；2）训练与测试之间的暴露偏差。基于这两个原因，我们随后提出了一种有效的训练方法，即上下文一致性训练，该方法通过将翻译质量和延迟作为双目标进行优化，并在训练期间将预测结果暴露给模型，从而使得训练与测试中的上下文使用保持一致。在三个语言对上的实验验证了我们的直觉：借助上下文一致性训练方法，我们的系统首次在鼓励上下文一致性方面优于现有上下文不一致的系统。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日