A Mixed Learning Objective for Neural Machine Translation

机译：神经机翻译的混合学习目标

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Evaluation discrepancy and overcorrection phenomenon are two common problems in neural machine translation (NMT). NMT models are generally trained with word-level learning objective, but evaluated by sentence-level metrics. Moreover, the cross-entropy loss function discourages model to generate synonymous predictions and overcor-rect them to ground truth words. To address these two drawbacks, we adopt multi-task learning and propose a mixed learning objective (MLO) which combines the strength of word-level and sentence-level evaluation without modifying model structure. At word-level, it calculates semantic similarity between predicted and ground truth words. At sentence-level, it computes probabilistic n-gram matching scores of generated translations. We also combine a loss-sensitive scheduled sampling decoding strategy with MLO to explore its extensibility. Experimental results on IWSLT 2016 German-English and WMT 2019 English-Chinese datasets demonstrate that our methodology can significantly promote translation quality. The ablation study shows that both word-level and sentence-level learning objectives can improve BLEU scores. Furthermore, MLO is consistent with state-of-the-art scheduled sampling methods and can achieve further promotion.

机译：评估差异和过度矫正现象是神经机翻译（NMT）中的两个常见问题。 NMT模型通常受到单词级学习目标的培训，而是由句子级度量评估。此外，跨熵损失函数不鼓励模型，以产生同义预测和过度矩阵对地面的话语。为了解决这两个缺点，我们采用多任务学习，并提出混合学习目标（MLO），其结合了字级和句子级评估的强度而不修改模型结构。在Word级，它计算预测和地面真理词之间的语义相似性。在句子级，它计算了所生成的翻译的概率N-GRAM匹配。我们还将损失敏感的计划采样解码策略与MLO结合起来探索其可扩展性。 IWSLT的实验结果2016年德语 - 英语和WMT 2019英语 - 中文数据集表明，我们的方法可以显着促进翻译质量。消融研究表明，单词级和句子级学习目标可以改善BLEU分数。此外，MLO与最先进的预定采样方法一致，可以实现进一步的促销。

著录项

来源
《China National Conference on Computational Linguistics》|2020年|482p|共13页
会议地点
作者
Wenjie Lu; Leiying Zhou; Gongshen Liu; Quanhai Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Neural machine translation; Evaluation discrepancy; Overcorrection phenomenon;

机译：神经机翻译;评估差异;过度矫正现象;

相似文献

外文文献
中文文献
专利

1. A Joint Back-Translation and Transfer Learning Method for Low-Resource Neural Machine Translation [J] . Gong-Xu Luo, Ya-Ting Yang, Rui Dong, Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：低资源神经电机翻译的联合背翻译与转移学习方法
2. Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market [J] . Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Language Resources and Evaluation . 2006,第3a4期

机译：使用机器学习方法进行时态，方面和情态的日语到英语翻译，并与市场上的机器翻译系统进行比较
3. Improving neural machine translation with sentence alignment learning [J] . Shi Xuewen, Huang Heyan, Jian Ping, Neurocomputing . 2021,第Jana8期

机译：用句子对齐学习改善神经机翻译
4. A Mixed Learning Objective for Neural Machine Translation [C] . Wenjie Lu, Leiying Zhou, Gongshen Liu, China National Conference on Computational Linguistics . 2020

机译：神经机翻译的混合学习目标
5. Learning Code Transformations via Neural Machine Translation [D] . Tufano, Michele. 2019

机译：通过神经机翻译学习码转换
6. Mixed-Level Neural Machine Translation [O] . Thien Nguyen, Huu Nguyen, Phuoc Tran 2020

机译：混合级神经机翻译
7. Hunter NMT System for WMT18 Biomedical Translation Task: Transfer Learning in Neural Machine Translation [O] . Abdul Khan, Subhadarshi Panda, Jia Xu, 2018

机译：WMT18生物医学翻译任务的Hunter NMT系统：在神经机翻译中转移学习

A Mixed Learning Objective for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅