Historical Text Normalization with Delayed Rewards

机译：历史文本规范化与延迟奖励

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Training neural sequence-to-sequence models with simple token-level log-likelihood is now a standard approach to historical text normalization, albeit often outperformed by phrase-based models. Policy gradient training enables direct optimization for exact matches, and while the small datasets in historical text normalization are prohibitive of from-scratch reinforcement learning, we show that policy gradient fine-tuning leads to significant improvements across the board. Policy gradient training, in particular, leads to more accurate normalizations for long or unseen words.

机译：具有简单令牌级日志可能性的神经序列到序列模型现在是历史文本标准化的标准方法，尽管通常由基于短语的模型表现优于优势。政策梯度训练能够直接优化精确匹配，而历史文本规范中的小型数据集是禁止从头划线学习，我们表明政策梯度微调导致电路板上的显着改进。特别是政策梯度培训，可以为长或看不见的单词导致更准确的训练。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv p. 1324-1979|共6页
会议地点
作者
Simon Flachs; Marcel Bollmann; Anders S?gaard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Weighted finite-state transducers for normalization of historical texts [J] . Etxeberria Izaskun, Alegria Inaki, Uria Larraitz Natural language engineering . 2019,第PTa2期

机译：用于历史文本归一化的加权有限状态传感器
2. Willing to wait: Elevated reward-processing EEG activity associated with a greater preference for larger-but-delayed rewards [J] . Pornpattananangkul Narun, Nusslock Robin Neuropsychologia . 2016,第Null期

机译：愿意等待：更高的奖励处理脑电活动与更大的偏好但延迟的奖励相关
3. Improving control over the impulse for reward: Sensitivity of harmful alcohol drinkers to delayed reward but not immediate punishment [J] . Rossiter Sarah, Thompson Julian, Hester Robert Drug and alcohol dependence . 2012,第1a2期

机译：加强对奖励冲动的控制：有害饮酒者对延迟奖励的敏感性，但不能立即惩罚
4. Historical Text Normalization with Delayed Rewards [C] . Simon Flachs, Marcel Bollmann, Anders Søgaard Annual meeting of the Association for Computational Linguistics . 2019

机译：具有延迟奖励的历史文本规范化
5. Behavioral and neural evidence of incentive bias for immediate rewards relative to preference-matched delayed rewards. [D] . Luo, Shan. 2009

机译：相对于偏好匹配的延迟奖励而言，立即奖励的激励偏差的行为和神经证据。
6. Behavioral and Neural Evidence of Incentive Bias for Immediate Rewards Relative to Preference-Matched Delayed Rewards [O] . Shan Luo, George Ainslie, Lisa Giragosian, 2009

机译：相对于偏好匹配的延迟奖励的立即奖励的奖励偏差的行为和神经证据
7. Historical Text Normalization with Delayed Rewards [O] . Simon Flachs, Marcel Bollmann, Anders Søgaard 2019

机译：历史文本规范化与延迟奖励

Historical Text Normalization with Delayed Rewards

摘要

著录项

相似文献

相关主题

期刊订阅