On the effects of using word2vec representations in neural networks for dialogue act recognition

Christophe Cerisara; Pavel Kral; Ladislav Lenc

首页> 外文期刊>Computer speech and language >On the effects of using word2vec representations in neural networks for dialogue act recognition

【24h】

On the effects of using word2vec representations in neural networks for dialogue act recognition

机译：关于在神经网络中使用word2vec表示进行对话行为识别的效果

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dialogue act recognition is an important component of a large number of natural language processing pipelines. Many research works have been carried out in this area, but relatively few investigate deep neural networks and word embeddings. This is surprising, given that both of these techniques have proven exceptionally good in most other language-related domains. We propose in this work a new deep neural network that explores recurrent models to capture word sequences within sentences, and further study the impact of pretrained word embeddings. We validate this model on three languages: English, French and Czech. The performance of the proposed approach is consistent across these languages and it is comparable to the state-of-the-art results in English. More importantly, we confirm that deep neural networks indeed outperform a Maximum Entropy classifier, which was expected. However, and this is more surprising, we also found that standard word2vec embeddings do not seem to bring valuable information for this task and the proposed model, whatever the size of the training corpus is. We thus further analyse the resulting embeddings and conclude that a possible explanation may be related to the mismatch between the type of lexical-semantic information captured by the word2vec embeddings, and the kind of relations between words that is the most useful for the dialogue act recognition task.

机译：对话行为识别是大量自然语言处理管道的重要组成部分。在这一领域已经进行了许多研究工作，但是研究深度神经网络和词嵌入的研究相对较少。鉴于这两种技术在大多数其他与语言相关的领域中都被证明具有出色的性能，这令人惊讶。我们在这项工作中提出了一个新的深度神经网络，该网络将探索递归模型以捕获句子中的单词序列，并进一步研究预训练单词嵌入的影响。我们使用三种语言验证该模型：英语，法语和捷克语。所提出的方法在这些语言之间的性能是一致的，并且可以与英语的最新结果相媲美。更重要的是，我们确认深度神经网络的确优于预期的最大熵分类器。但是，这更令人惊讶，我们还发现，无论训练语料库的大小如何，标准的word2vec嵌入似乎都无法为该任务和建议的模型带来有价值的信息。因此，我们进一步分析了产生的嵌入，并得出结论，可能的解释可能与word2vec嵌入捕获的词汇语义信息的类型与对对话行为识别最有用的单词之间的关系类型之间的不匹配有关。任务。

著录项

来源
《Computer speech and language》 |2018年第1期|175-193|共19页
作者
Christophe Cerisara; Pavel Kral; Ladislav Lenc;
展开▼
作者单位

WRIA-UMR7503, Campus Scientifique, 54506 Vandoeuvre-les-Nancy, France;

Department of Computer Science & Engineering, Faculty of Applied Sciences, University of West Bohemia, Plzen, Czechia,NTIS - New Technologies for the Information Society, Faculty of Applied Sciences, University of West Bohemia, Plzen, Czechia;

Department of Computer Science & Engineering, Faculty of Applied Sciences, University of West Bohemia, Plzen, Czechia,NTIS - New Technologies for the Information Society, Faculty of Applied Sciences, University of West Bohemia, Plzen, Czechia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Dialogue act; Deep learning; LSTM; Word embeddings; Word2vec;

机译：对话行为;深度学习;LSTM;词嵌入;Word2vec;

相似文献

外文文献
中文文献
专利

1. Combining heterogeneous deep neural networks with conditional random fields for Chinese dialogue act recognition [J] . Zhou Yucan, Hu Qinghua, Liu Jie, Neurocomputing . 2015,第nova30期

机译：结合异构深度神经网络和条件随机场进行中文对话行为识别
2. Joint word2vec Networks for Bilingual Semantic Representations [J] . Lior Wolf, Yair Hanani, Kfir Bar, International journal of computational linguistics and applications . 2014,第1期

机译：双语语义表示的联合word2vec网络
3. Text classification based on word2vec and convolutional neural networks [J] . Basic & clinical pharmacology & toxicology. . 2019,第S10期

机译：基于Word2VEC和卷积神经网络的文本分类
4. Representation, scaling, and time invariance in neural networkspeech recognition: evidence for the recognition of stop consonants andvowels [C] . Will C.A., Bunnell H.T. 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2001. Proceedings, 2001 . 2001

机译：神经网络中的表示，缩放和时间不变性语音识别：停止辅音和元音识别的证据
5. Neural Network Based Representation Learning and Modeling for Speech and Speaker Recognition [D] . Guo, Jinxi. 2019

机译：基于神经网络的语言和扬声器识别的模拟
6. Word2vec convolutional neural networks for classification of news articles and tweets [O] . Beakcheol Jang, Inhwan Kim, Jong Wook Kim 2012

机译：Word2vec卷积神经网络用于新闻文章和推文分类
7. On the effects of using word2vec representations in neural networks for dialogue act recognition [O] . Christophe Cerisara, Pavel Král, Ladislav Lenc 2018

机译：论神经网络中word2vec表示法在对话行为识别中的作用

On the effects of using word2vec representations in neural networks for dialogue act recognition

摘要

著录项

相似文献

相关主题

期刊订阅