A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

机译：语篇关系语言模型的隐变量递归神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual words, thus reaping the benefits of discriminatively-trained vector representations. The discourse relations are represented with a latent variable, which can be predicted or marginalized, depending on the task. The resulting model can therefore employ a training objective that includes not only discourse relation classification, but also word prediction. As a result, it outperforms state-of-the-art alternatives for two tasks: implicit discourse relation classification in the Penn Discourse Treebank, and dialog act classification in the Switchboard corpus. Furthermore, by marginalizing over latent discourse relations at test time, we obtain a discourse informed language model, which improves over a strong LSTM baseline.

机译：本文提出了一种新颖的潜在变量递归神经网络体系结构，用于联合建模单词序列和相邻句子之间的（可能是潜在的）语篇关系。递归神经网络生成单个单词，从而收获了经过区别训练的矢量表示的好处。话语关系用潜在变量表示，根据任务可以预测或边缘化。因此，所得模型可以采用训练目标，该目标不仅包括话语关系分类，还包括单词预测。结果，它优于两项任务的最新选择：宾夕法尼亚州话语树库中的隐式话语关系分类和总机面板语料库中的对话行为分类。此外，通过在测试时边缘化潜在的话语关系，我们获得了话语告知语言模型，该模型在强大的LSTM基线之上得到了改进。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|332-342|共11页
会议地点
作者
Yangfeng Ji; Gholamreza Haffari; Jacob Eisenstein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
2. Learning behavioral models by recurrent neural networks with discrete latent representations with application to a flexible industrial conveyor [J] . Brusaferri Alessandro, Matteucci Matteo, Spinelli Stefano, Computers in Industry . 2020,第1期

机译：通过具有离散的神经网络与应用于柔性工业输送机的离散潜在表示学习行为模型
3. Improving Recurrent Neural Networks for Offline Arabic Handwriting Recognition by Combining Different Language Models [J] . Jemni Sana Khamekhem, Kessentini Yousri, Kanoun Slim International Journal of Pattern Recognition and Artificial Intelligence . 2020,第12期

机译：通过组合不同的语言模型，改进反际阿拉伯语手写识别的反复性神经网络
4. A Latent Variable Recurrent Neural Network for Discourse Relation Language Models [C] . Yangfeng Ji, Gholamreza Haffari, Jacob Eisenstein Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：话语关系语言模型潜在可变经常性神经网络
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Making Cognitive Latent Variables Manifest: Distinct Neural Networks for Fluid Reasoning and Processing Speed [O] . Christian Habeck, Jason Steffener, Daniel Barulli, -1

机译：表现出潜在的认知变量：用于流体推理和处理速度的独特神经网络
7. A Latent Variable Recurrent Neural Network for Discourse-Driven Language Models [O] . Yangfeng Ji, Gholamreza Haffari, Jacob Eisenstein 2016

机译：用于话语驱动语言模型的潜在可变经常性神经网络

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

摘要

著录项

相似文献

相关主题

期刊订阅