Simultaneous Machine Translation with Tailored Reference

Simultaneous machine translation (SiMT) generates translation while reading the whole source sentence. However, existing SiMT models are typically trained using the same reference disregarding the varying amounts of available source information at different latency. Training the model with ground-truth at low latency may introduce forced anticipations, whereas utilizing reference consistent with the source word order at high latency results in performance degradation. Consequently, it is crucial to train the SiMT model with appropriate reference that avoids forced anticipations during training while maintaining high quality. In this paper, we propose a novel method that provides tailored reference for the SiMT models trained at different latency by rephrasing the ground-truth. Specifically, we introduce the tailor, induced by reinforcement learning, to modify ground-truth to the tailored reference. The SiMT model is trained with the tailored reference and jointly optimized with the tailor to enhance performance. Importantly, our method is applicable to a wide range of current SiMT approaches. Experiments on three translation tasks demonstrate that our method achieves state-of-the-art performance in both fixed and adaptive policies.

翻译：同步机器翻译（SiMT）在读取整个源句子的同时生成译文。然而，现有SiMT模型通常使用相同的参考进行训练，忽略了不同延迟下可用源信息的差异。在低延迟下使用真值训练模型可能引入强制预测，而在高延迟下使用与源词序一致的参考则会导致性能下降。因此，用适当的参考训练SiMT模型至关重要，既要避免训练中的强制预测，又要保持高质量。本文提出一种新方法，通过重新表述真值为不同延迟训练的SiMT模型提供定制参考。具体而言，我们引入由强化学习驱动的定制器，将真值修改为定制参考。SiMT模型使用定制参考进行训练，并与定制器联合优化以提升性能。重要的是，该方法适用于当前主流的多种SiMT方法。在三项翻译任务上的实验表明，我们的方法在固定策略和自适应策略下均达到了最优性能。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日