Bayesian Recurrent Neural Network for Language Modeling

Chien Jen-Tzung; Ku Yuan-Chu

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Bayesian Recurrent Neural Network for Language Modeling

【24h】

Bayesian Recurrent Neural Network for Language Modeling

机译：贝叶斯递归神经网络的语言建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A language model (LM) is calculated as the probability of a word sequence that provides the solution to word prediction for a variety of information systems. A recurrent neural network (RNN) is powerful to learn the large-span dynamics of a word sequence in the continuous space. However, the training of the RNN-LM is an ill-posed problem because of too many parameters from a large dictionary size and a high-dimensional hidden layer. This paper presents a Bayesian approach to regularize the RNN-LM and apply it for continuous speech recognition. We aim to penalize the too complicated RNN-LM by compensating for the uncertainty of the estimated model parameters, which is represented by a Gaussian prior. The objective function in a Bayesian classification network is formed as the regularized cross-entropy error function. The regularized model is constructed not only by calculating the regularized parameters according to the maximum criterion but also by estimating the Gaussian hyperparameter by maximizing the marginal likelihood. A rapid approximation to a Hessian matrix is developed to implement the Bayesian RNN-LM (BRNN-LM) by selecting a small set of salient outer-products. The proposed BRNN-LM achieves a sparser model than the RNN-LM. Experiments on different corpora show the robustness of system performance by applying the rapid BRNN-LM under different conditions.

机译：语言模型（LM）被计算为单词序列的概率，它为各种信息系统的单词预测提供了解决方案。递归神经网络（RNN）强大，可以学习连续空间中单词序列的大跨度动态。但是，由于来自大字典大小和高维隐藏层的参数太多，因此RNN-LM的训练是一个不适的问题。本文提出了一种贝叶斯方法来规范化RNN-LM并将其应用于连续语音识别。我们旨在通过补偿估计的模型参数的不确定性来惩罚过于复杂的RNN-LM，该不确定性由高斯先验表示。贝叶斯分类网络中的目标函数形成为正则化的交叉熵误差函数。不仅通过根据最大准则计算正则化参数来构造正则化模型，而且还通过最大化边际可能性来估计高斯超参数来构造正则化模型。通过选择少量显着的外部乘积，开发出了对Hessian矩阵的快速近似以实现贝叶斯RNN-LM（BRNN-LM）。提出的BRNN-LM比RNN-LM实现了稀疏模型。在不同语料库上的实验通过在不同条件下应用快速BRNN-LM显示了系统性能的鲁棒性。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2016年第2期|361-374|共14页
作者
Chien Jen-Tzung; Ku Yuan-Chu;
展开▼
作者单位

Department of Electrical and Computer Engineering, National Chiao Tung University, Hsinchu, Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bayesian learning; Hessian matrix; language model; rapid approximation; recurrent neural network; recurrent neural network.;

机译：贝叶斯学习;Hessian矩阵;语言模型;快速逼近;递归神经网络;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks [J] . Bitzer S., Kiebel S.J. Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2012,第4a5期

机译：识别递归神经网络（rRNN）：递归神经网络的贝叶斯推断
2. Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks [J] . Sebastian Bitzer, Stefan J. Kiebel Biological Cybernetics . 2012,第4a5期

机译：识别递归神经网络（rRNN）：递归神经网络的贝叶斯推断
3. Recursive Bayesian Recurrent Neural Networks for Time-Series Modeling [J] . Mirikitani D. T., Nikolaev N. Neural Networks, IEEE Transactions on . 2010,第2期

机译：时间序列建模的递归贝叶斯递归神经网络
4. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [C] . Zhe Gan, Chunyuan Li, Changyou Chen, Annual meeting of the Association for Computational Linguistics;Conference of the European Chapter of the Association for Computational Linguistics . 2017

机译：递归神经网络的可扩展贝叶斯学习用于语言建模
5. Gene expression temporal patterns classification with hierarchical Bayesian neural networks and time lagged recurrent neural networks. [D] . Liang, Yulan. 2003

机译：利用分层贝叶斯神经网络和时滞递归神经网络对基因表达时间模式进行分类。
6. Large-scale directional connections among multi resting-state neural networks in human brain: A functional MRI and Bayesian network modeling study [O] . Rui Li, Kewei Chen, Adam S. Fleisher, -1

机译：人脑中多静态神经网络中的大规模定向连接：功能性MRI和贝叶斯网络建模研究
7. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [O] . Gan, Zhe, Li, Chunyuan, Chen, Changyou, 2017

机译：语言递归神经网络的可扩展贝叶斯学习造型

Bayesian Recurrent Neural Network for Language Modeling

摘要

著录项

相似文献

相关主题

期刊订阅