Syntax has been proven to be remarkably effective in neural machine translation (NMT). Previous models obtained syntax information from syntactic parsing tools and integrated it into NMT models to improve translation performance. In this work, we propose a method to incorporate syntax information into a complex-valued Encoder-Decoder architecture. The proposed model jointly learns word-level and syntax-level attention scores from the source side to the target side using an attention mechanism. Importantly, it is not dependent on specific network architectures and can be directly integrated into any existing sequence-to-sequence (Seq2Seq) framework. The experimental results demonstrate that the proposed method can bring significant improvements in BLEU scores on two datasets. In particular, the proposed method achieves a greater improvement in BLEU scores in translation tasks involving language pairs with significant syntactic differences.
翻译:句法在神经机器翻译(NMT)中已被证明具有显著效果。先前的研究通过句法分析工具获取句法信息,并将其整合到NMT模型中以提高翻译性能。在本工作中,我们提出了一种将句法信息融入复数值编码器-解码器架构的方法。该模型通过注意力机制,联合学习从源端到目标端的词级和句法级注意力分数。重要的是,该方法不依赖于特定的网络架构,可直接集成到任何现有的序列到序列(Seq2Seq)框架中。实验结果表明,所提方法能在两个数据集上显著提升BLEU分数。特别地,在涉及句法差异较大的语言对的翻译任务中,所提方法在BLEU分数上取得了更大的提升。