【24h】

Neural Machine Translation of Text from Non-Native Speakers

机译:从非母语人员的文本的神经机翻译

获取原文

摘要

Neural Machine Translation (NMT) systems are known to degrade when confronted with noisy data, especially when the system is trained only on clean data. In this paper, we show that augmenting training data with sentences containing artificially-introduced grammatical errors can make the system more robust to such errors. In combination with an automatic grammar error correction system, we can recover 1.0 BLEU out of 2.4 BLEU lost due to grammatical errors. We also present a set of Spanish translations of the JFLEG grammar error correction corpus, which allows for testing NMT robustness to real grammatical errors.
机译:众所周知,当遇到嘈杂的数据时,已知神经机翻译(NMT)系统降解,特别是当系统仅在清洁数据上培训时。在本文中,我们显示增强培训数据与包含人工引入的语法错误的句子可以使系统对此类错误更加强大。结合自动语法纠错系统,我们可以由于语法错误而恢复1.0 BLEU丢失。我们还提出了一组JFLEG语法错误校正语料库的西班牙语翻译,它允许测试NMT鲁棒性与真实的语法错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号