Multi-reward Based Reinforcement Learning for Neural Machine Translation

机译：基于多奖励的神经电脑翻译加固学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) has made remarkable progress in neural machine translation (NMT). However, it exists the problems with uneven sampling distribution, sparse rewards and high variance in training phase. Therefore, we propose a multi-reward reinforcement learning training strategy to decouple action selection and value estimation. Meanwhile, our method combines with language model rewards to jointly optimize model parameters. In addition, we add Gumbel noise in sampling to obtain more effective semantic information. To verify the robustness of our method, we not only conducted experiments on large corpora, but also performed on low-resource languages. Experimental results show that our work is superior to the baselines in WMT14 English-German, LDC2014 Chinese-English and CWMT2018 Mongolian-Chinese tasks, which fully certificates the effectiveness of our method.

机译：强化学习（RL）在神经机翻译（NMT）中取得了显着进展。然而，它存在不均匀的采样分布，稀疏奖励和训练阶段的高方差存在问题。因此，我们提出了一种多奖励加强学习培训策略来解耦行动选择和价值估计。同时，我们的方法与语言模型奖励相结合，共同优化了模型参数。此外，我们在采样中添加Gumbel噪声以获得更有效的语义信息。为了验证我们方法的稳健性，我们不仅对大型语料库进行了实验，还对低资源语言进行了实验。实验结果表明，我们的工作优于WMT14英语 - 德语，LDC2014中英和CWMT2018蒙古 - 中文任务的基线，这完全证明了我们方法的有效性。

著录项

来源
《China National Conference on Computational Linguistics》|2020年|482p|共14页
会议地点
作者
Shuo Sun; Hongxu Hou; Nier Wu; Ziyue Guo; Chaowei Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Reinforcement learning; Multi-reward; Gumbel noise;

机译：强化学习;多奖励;Gumbel噪音;

相似文献

外文文献
中文文献
专利

1. Machine-learning-based simulation and fed-batch control of cyanobacterial-phycocyanin production in Plectonema by artificial neural network and deep reinforcement learning [J] . Yan Ma, Daniel A. Norena-Caro, Alexandria J. Adams, Computers & Chemical Engineering . 2020,第Nova2期

机译：通过人工神经网络和深加固学习在Plectonema中基于机器学习的仿真和喂养分批控制植物植物植物植物
2. Research on Machine Translation of Deep Neural Network Learning Model Based on Ontology [J] . Yaya Tian, Shaweta Khanna, Anton Pljonkin Informatica: An International Journal of Computing and Informatics . 2021,第5期

机译：基于本体论的深神经网络学习模型机器翻译研究
3. Post-editing neural machine translation versus phrase-based machine translation for English-Chinese [J] . Yanfang Jia, Michael Carl, Xiangling Wang Machine translation . 2019,第1a2期

机译：英译后的神经机器翻译与基于短语的机器翻译
4. Multi-reward Based Reinforcement Learning for Neural Machine Translation [C] . Shuo Sun, Hongxu Hou, Nier Wu, China National Conference on Computational Linguistics . 2020

机译：基于多奖励的神经机翻译的强化学习
5. Learning Code Transformations via Neural Machine Translation [D] . Tufano, Michele. 2019

机译：通过神经机翻译学习码转换
6. Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization [O] . Eric A. Pohlmeyer, Babak Mahmoudi, Shijia Geng, -1

机译：尽管进行了神经输入重组但仍使用强化学习来提供稳定的脑机接口控制
7. Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation [O] . Samuel Kiegeland, Julia Kreutzer 2021

机译：重新探究神经电脑翻译钢筋学习的弱点
8. Application of Fuzzy Logic-Neural Network Based Reinforcement Learning toProximity and Docking Operations: Translational Controller Results [R] . Jani, Y. 1992

机译：基于模糊逻辑 - 神经网络的强化学习在邻近和对接操作中的应用：平移控制器结果

Multi-reward Based Reinforcement Learning for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅