Recurrent Memory Networks for Language Modeling

机译：循环内存网络用于语言建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recurrent Neural Networks (RNNs) have obtained excellent result in many natural language processing (NLP) tasks. However, understanding and interpreting the source of this success remains a challenge. In this paper, we propose Recurrent Memory Network (RMN), a novel RNN architecture, that not only amplifies the power of RNN but also facilitates our understanding of its internal functioning and allows us to discover underlying patterns in data. We demonstrate the power of RMN on language modeling and sentence completion tasks. On language modeling, RMN outperforms Long Short-Term Memory (LSTM) network on three large German, Italian, and English dataset. Additionally we perform in-depth analysis of various linguistic dimensions that RMN captures. On Sentence Completion Challenge, for which it is essential to capture sentence coherence, our RMN obtains 69.2% accuracy, surpassing the previous state of the art by a large margin.

机译：递归神经网络（RNN）在许多自然语言处理（NLP）任务中均获得了出色的成绩。然而，理解和解释这种成功的根源仍然是一个挑战。在本文中，我们提出了一种循环神经网络（RMN），这是一种新颖的RNN体系结构，它不仅可以放大RNN的功能，还可以帮助我们了解其内部功能，并允许我们发现数据中的潜在模式。我们展示了RMN在语言建模和句子完成任务上的强大功能。在语言建模方面，RMN在三个大型德语，意大利语和英语数据集上的表现优于长期短期记忆（LSTM）网络。此外，我们对RMN捕获的各种语言维度进行深入分析。在完成句子连贯性至关重要的“句子完成挑战”上，我们的RMN可获得69.2％的准确率，大大超过了现有技术水平。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|321-331|共11页
会议地点
作者
Ke Tran; Arianna Bisazza; Christof Monz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature memory-based deep recurrent neural network for language modeling [J] . Deng Hongli, Zhang Lei, Shu Xin Applied Soft Computing . 2018,第期

机译：用于语言建模的基于存储器的深度复发性神经网络
2. Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification [J] . China Bhanja Chuya, Laskar Mohammad A., Laskar Rabul H. Expert Systems . 2020,第5期

机译：级联卷积神经网络长短期内存经常性神经网络，用于自动色调和非统计学预分配的印度语言识别
3. Parsimonious memory unit for recurrent neural networks with application to natural language processing [J] . Morchid Mohamed Neurocomputing . 2018,第NOVa7期

机译：递归神经网络的简约记忆单元及其在自然语言处理中的应用
4. Recurrent Memory Networks for Language Modeling [C] . Ke Tran, Arianna Bisazza, Christof Monz Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：用于语言建模的经常性存储器网络
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks [O] . Ruben Zazo, Alicia Lozano-Diez, Javier Gonzalez-Dominguez, 2011

机译：使用长短期记忆（LSTM）递归神经网络的短话语语言识别
7. Recurrent Memory Networks for Language Modeling [O] . Tran, Ke, Bisazza, Arianna, Monz, Christof 2016

机译：用于语言建模的循环记忆网络

Recurrent Memory Networks for Language Modeling

摘要

著录项

相似文献

相关主题

期刊订阅