LSTMs Exploit Linguistic Attributes of Data

机译：LSTM利用数据的语言属性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

While recurrent neural networks have found success in a variety of natural language processing applications, they are general models of sequential data. We investigate how the properties of natural language data affect an LSTM's ability to learn a nonlinguistic task: recalling elements from its input. We find that models trained on natural language data are able to recall tokens from much longer sequences than models trained on non-language sequential data. Furthermore, we show that the LSTM learns to solve the memorization task by explicitly using a subset of its neurons to count timesteps in the input. We hypothesize that the patterns and structure in natural language data enable LSTMs to learn by providing approximate ways of reducing loss, but understanding the effect of different training data on the learnability of LSTMs remains an open question.

机译：虽然递归神经网络已在各种自然语言处理应用中获得成功，但它们是顺序数据的通用模型。我们研究自然语言数据的属性如何影响LSTM学习非语言任务的能力：从其输入中回忆元素。我们发现，与在非语言顺序数据上训练的模型相比，在自然语言数据上训练的模型能够从更长的序列中调用令牌。此外，我们显示LSTM通过显式使用其神经元的子集来计算输入中的时间步长，从而学会解决记忆任务。我们假设自然语言数据中的模式和结构通过提供减少损失的近似方法使LSTM能够学习，但是了解不同训练数据对LSTM可学习性的影响仍然是一个悬而未决的问题。

著录项

来源
《3rd workshop on representation learning for NLP 2018》|2018年|180-186|共7页
会议地点 Melbourne(AU)
作者
Nelson F. Liu; Omer Levy; Roy Schwartz; Chenhao Tan; Noah A. Smith;
展开▼
作者单位

Paul G. Allen School of Computer Science Engineering, University of Washington, Seattle, WA, USA,Department of Linguistics, University of Washington, Seattle, WA, USA;

Paul G. Allen School of Computer Science Engineering, University of Washington, Seattle, WA, USA;

Paul G. Allen School of Computer Science Engineering, University of Washington, Seattle, WA, USA,Allen Institute for Artificial Intelligence, Seattle, WA, USA;

Department of Computer Science, University of Colorado, Boulder, CO, USA;

Paul G. Allen School of Computer Science Engineering, University of Washington, Seattle, WA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Multi-Attribute Decision Making Method Based on Interval Linguistic Standard Deviation Weight-Interval Linguistic Technique for Order Preference by Similarity to Ideal Solution in Interval Linguistic Environment [J] . Rong Zhao, Mingming Hu, Maozhu Jin, Journal of computational and theoretical nanoscience . 2015,第12期

机译：一种基于间隔语言标准偏差权力间隔技术的多属性决策方法，用于间隔语言环境中的理想解决方案的顺序优先
2. Consumer sorting and hedonic valuation of wine attributes: exploiting data from a field experiment [J] . Gustafson Christopher R., Lybbert Travis J., Sumner Daniel A. Agricultural Economics . 2016,第1期

机译：消费者对葡萄酒属性的分类和享乐价值评估：利用现场实验的数据
3. A new approach to mining fuzzy databases using nearest neighbor classification by exploiting attribute hierarchies [J] . De SK, Krishna PR International Journal of Intelligent Systems . 2004,第12期

机译：利用属性层次结构的最近邻分类挖掘模糊数据库的新方法
4. LSTMs Exploit Linguistic Attributes of Data [C] . Nelson F. Liu, Omer Levy, Roy Schwartz, Annual meeting of the Association for Computational Linguistics . 2018

机译：LSTMS利用数据的语言属性
5. RNN/LSTM Data Assimilation for the Lorenz Chaotic Models [D] . Vashistha, Harsh Vardhan. 2018

机译：Lorenz混沌模型的RNN / LSTM数据同化
6. An Approach to Linguistic Multiple Attribute Decision-Making Based on Unbalanced Linguistic Generalized Heronian Mean Aggregation Operator [O] . Bing Han, Huayou Chen, Jiaming Zhu, 2018

机译：基于不平衡语言广义Heronian均值聚合算子的语言多属性决策方法
7. LSTMs Exploit Linguistic Attributes of Data [O] . Nelson F. Liu, Omer Levy, Roy Schwartz, 2018

机译：LSTMS利用数据的语言属性

LSTMs Exploit Linguistic Attributes of Data

摘要

著录项

相似文献

相关主题

期刊订阅