Learning Sparse Hidden States in Long Short-Term Memory

机译：在长短期记忆中学习稀疏隐藏状态

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Long Short-Term Memory (LSTM) is a powerful recurrent neural network architecture that is successfully used in many sequence modeling applications. Inside an LSTM unit, a vector called "memory cell" is used to memorize the history. Another important vector, which works along with the memory cell, represents hidden states and is used to make a prediction at a specific step. Memory cells record the entire history, while the hidden states at a specific time step in general need to attend only to very limited information thereof. Therefore, there exists an imbalance between the huge information carried by a memory cell and the small amount of information requested by the hidden states at a specific step. We propose to explicitly impose sparsity on the hidden states to adapt them to the required information.

机译：长短期记忆（LSTM）是一种功能强大的递归神经网络体系结构，已成功用于许多序列建模应用程序中。在LSTM单元内部，使用一个称为“内存单元”的向量来存储历史记录。与存储单元一起工作的另一个重要向量表示隐藏状态，并用于在特定步骤进行预测。存储器单元记录整个历史，而在特定时间步的隐藏状态通常仅需要注意其非常有限的信息。因此，在特定步骤中，存储单元所携带的大量信息与隐藏状态所请求的少量信息之间存在不平衡。我们建议明确地对稀疏状态施加稀疏性，以使它们适应所需的信息。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|288-298|共11页
会议地点
作者
Mange Yu; Cornelius Weber; Xiaolin Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Recurrent neural network (RNN); Long Short-Term Memory (LSTM); Language modeling; Image captioning; Network acceleration;

机译：递归神经网络（RNN）;长短期记忆（LSTM）;语言建模;图片字幕;网络加速;

相似文献

外文文献
中文文献
专利

1. Bidirectional long short-term memory networks and sparse hierarchical modeling for scalable educational learning of dance choreographies [J] . Rallis Ioannis, Bakalos Nikolaos, Doulamis Nikolaos, The Visual Computer . 2021,第1期

机译：双向长期内记忆网络和舞蹈教育教育学习的稀疏层次建模
2. Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification [J] . Wang Zheng, Jiang Junjun, Wu Yang, IEEE Transactions on Image Processing . 2020,第期

机译：学习稀疏和身份保存的隐藏属性用于人重新识别
3. A data-driven approach based on long short-term memory and hidden Markov model for crack propagation prediction [J] . Nguyen-Le Duyen H., Tao Q. B., Vu-Hieu Nguyen, Engineering Fracture Mechanics . 2020,第1期

机译：一种基于长短期内存和隐马尔可夫模型的数据驱动方法，用于裂纹传播预测
4. Learning Sparse Hidden States in Long Short-Term Memory [C] . Mange Yu, Cornelius Weber, Xiaolin Hu International Conference on Artificial Neural Networks . 2019

机译：在长短短期记忆中学习稀疏隐藏状态
5. A comparison of musicians' and nonmusicians' phonological short-term memory, visual short-term memory, and short-term memory for pitch [D] . Mitchell, Lucy 2013

机译：音乐家和非音乐家的语音短期记忆，视觉短期记忆和音调短期记忆的比较
6. Better Phonological Short-Term Memory Is Linked to Improved Cortical Memory Representations for Word Forms and Better Word Learning [O] . Sari Ylinen, Anni Nora, Elisabet Service 2020

机译：更好的语音短期记忆与单词形式和更好的词学习改进的皮质内存表示相关联
7. Serial order short-term memory, item short-term memory and vocabulary learning in bilingual adults of Cantonese and English [O] . Ip Hoi-yan Nathalie, 葉凱欣 2011

机译：粤语和英语双语成人的连续短期记忆，项目短期记忆和词汇学习

Learning Sparse Hidden States in Long Short-Term Memory

摘要

著录项

相似文献

相关主题

期刊订阅