...
首页> 外文期刊>Computational linguistics >Incorporating Source-Side Phrase Structures into Neural Machine Translation
【24h】

Incorporating Source-Side Phrase Structures into Neural Machine Translation

机译:将源语言短语结构纳入神经机器翻译

获取原文
           

摘要

Neural machine translation (NMT) has shown great success as a new alternative to the traditional Statistical Machine Translation model in multiple languages. Early NMT models are based on sequence-to-sequence learning that encodes a sequence of source words into a vector space and generates another sequence of target words from the vector. In those NMT models, sentences are simply treated as sequences of words without any internal structure. In this article, we focus on the role of the syntactic structure of source sentences and propose a novel end-to-end syntactic NMT model, which we call a tree-to-sequence NMT model, extending a sequence-to-sequence model with the source-side phrase structure. Our proposed model has an attention mechanism that enables the decoder to generate a translated word while softly aligning it with phrases as well as words of the source sentence. We have empirically compared the proposed model with sequence-to-sequence models in various settings on Chinese-to-Japanese and English-to-Japanese translation tasks. Our experimental results suggest that the use of syntactic structure can be beneficial when the training data set is small, but is not as effective as using a bi-directional encoder. As the size of training data set increases, the benefits of using a syntactic tree tends to diminish.
机译:神经机器翻译(NMT)作为多语言传统统计机器翻译模型的新替代品已显示出巨大的成功。早期的NMT模型基于序列到序列的学习,该学习将源词序列编码到向量空间中,并从向量中生成目标词的另一个序列。在那些NMT模型中,句子被简单地视为单词序列,没有任何内部结构。在本文中,我们着眼于源句的句法结构的作用,并提出了一种新颖的端到端句法NMT模型,我们将其称为树到序列NMT模型,并扩展了序列到序列模型。源端短语结构。我们提出的模型具有一种关注机制,该机制可使解码器生成翻译的单词,同时使其与短语以及源句子的单词进行软对齐。我们已在汉日和英日翻译任务的各种设置中对提议的模型与序列到序列模型进行了经验比较。我们的实验结果表明,当训练数据集较小时,使用语法结构可能会有所帮助,但不如使用双向编码器有效。随着训练数据集的大小增加,使用语法树的好处趋于减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号