Neural Machine Translation of Text from Non-Native Speakers

机译：从非母语人员的文本的神经机翻译

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural Machine Translation (NMT) systems are known to degrade when confronted with noisy data, especially when the system is trained only on clean data. In this paper, we show that augmenting training data with sentences containing artificially-introduced grammatical errors can make the system more robust to such errors. In combination with an automatic grammar error correction system, we can recover 1.0 BLEU out of 2.4 BLEU lost due to grammatical errors. We also present a set of Spanish translations of the JFLEG grammar error correction corpus, which allows for testing NMT robustness to real grammatical errors.

机译：众所周知，当遇到嘈杂的数据时，已知神经机翻译（NMT）系统降解，特别是当系统仅在清洁数据上培训时。在本文中，我们显示增强培训数据与包含人工引入的语法错误的句子可以使系统对此类错误更加强大。结合自动语法纠错系统，我们可以由于语法错误而恢复1.0 BLEU丢失。我们还提出了一组JFLEG语法错误校正语料库的西班牙语翻译，它允许测试NMT鲁棒性与真实的语法错误。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|xciii p. 2799-3497|共11页
会议地点
作者
Antonios Anastasopoulos; Alison Lui; Toan Q. Nguyen; David Chiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Neural-based Machine Translation for Medical Text Domain. Based on European Medicines Agency Leaflet Texts [J] . Krzysztof Wo?k, Krzysztof Marasek Procedia Computer Science . 2015,第1期

机译：用于医学文本域的基于神经的机器翻译。根据欧洲药品管理局传单文本
2. Impact of Filtering Generated Pseudo Bilingual Texts in Low-Resource Neural Machine Translation Enhancement: The Case of Persian-Spanish [J] . Benyamin Ahmadnia, Bonnie J. Dorr, Raul Aranovich Procedia Computer Science . 2021,第a期

机译：滤波产生的伪双语文本在低资源神经机翻译增强中的影响：波斯语西班牙语的情况
3. Cross-lingual text similarity exploiting neural machine translation models [J] . Kazuhiro Seki Journal of Information Science . 2021,第3期

机译：跨语言文本相似性利用神经机翻译模型
4. Neural Machine Translation of Text from Non-Native Speakers [C] . Antonios Anastasopoulos, Alison Lui, Toan Q. Nguyen, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：来自非母语者的文本的神经机器翻译
5. Cohesion in translation: A corpus study of human-translated, machine-translated, and non-translated texts (Russian into English). [D] . Bystrova-McIntyre, Tatyana. 2012

机译：翻译中的衔接：对人工翻译，机器翻译和非翻译文本（俄语译成英语）的语料库研究。
6. Neural Machine Translation–Based Automated Current Procedural Terminology Classification System Using Procedure Text: Development and Validation Study [O] . Hyeon Joo, Michael Burns, Sai Saradha Kalidaikurichi Lakshmanan, 2021

机译：基于神经电机的自动化当前程序术语分类系统使用过程文本：开发和验证研究
7. Emotion Detection in Non-native English Speakers’ Text-Only Messages by Native and Non-native Speakers [O] . Hautasaari, Ari, Yamashita, Naomi 2015

机译：母语和非母语的英语母语者的纯文本消息中的情绪检测

Neural Machine Translation of Text from Non-Native Speakers

摘要

著录项

相似文献

相关主题

期刊订阅