A Study on Catastrophic Forgetting in Deep LSTM Networks

机译：深度LSTM网络中的灾难性遗忘研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a systematic study of Catastrophic Forgetting (CF), i.e., the abrupt loss of previously acquired knowledge, when retraining deep recurrent LSTM networks with new samples. CF has recently received renewed attention in the case of feed-forward DNNs, and this article is the first work that aims to rigorously establish whether deep LSTM networks are afflicted by CF as well, and to what degree. In order to test this fully, training is conducted using a wide variety of high-dimensional image-based sequence classification tasks derived from established visual classification benchmarks (MNIST, Devanagari, Fash-ionMNIST and EMNIST). We find that the CF effect occurs universally, without exception, for deep LSTM-based sequence classifiers, regardless of the construction and provenance of sequences. This leads us to conclude that LSTMs, just like DNNs, are fully affected by CF, and that further research work needs to be conducted in order to determine how to avoid this effect (which is not a goal of this study).

机译：我们提供了对灾难性遗忘（CF）的系统研究，即当使用新样本重新训练深度递归LSTM网络时，先前获得的知识突然丢失。 CF最近在前馈DNN方面受到了新的关注，本文是第一篇旨在严格确定深层LSTM网络是否也受CF影响以及在何种程度上受到影响的工作。为了对此进行全面测试，使用从建立的视觉分类基准（MNIST，Devanagari，Fash-ionMNIST和EMNIST）派生的各种基于高维图像的序列分类任务进行训练。我们发现，对于基于LSTM的深度分类器而言，CF效应无一例外地普遍发生，而与序列的构造和出处无关。这使我们得出结论，就像DNN一样，LSTM完全受CF影响，因此需要进行进一步的研究工作以确定如何避免这种影响（这不是本研究的目的）。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|714-728|共15页
会议地点
作者
Monika Schak; Alexander Gepperth;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
LSTM; Catastrophic Forgetting;

机译：LSTM;灾难性的遗忘;

相似文献

外文文献
中文文献
专利

1. Big Data Analytics Using Deep LSTM Networks: A Case Study for Weather Prediction [J] . Shweta Mittal, Om Prakash Sangwan Advances in Science, Technology and Engineering Systems . 2020,第2期

机译：使用深层LSTM网络的大数据分析：天气预报的案例研究
2. NBLSTM: Noisy and Hybrid Convolutional Neural Network and BLSTM-Based Deep Architecture for Remaining Useful Life Estimation [J] . Journal of Computing and Information Science in Engineering . 2020,第2期

机译：NBLSTM：嘈杂的混合卷积神经网络和基于BLSTM的深度架构，用于剩余使用寿命估算
3. EEG-based mental workload estimation using deep BLSTM-LSTM network and evolutionary algorithm [J] . Das Chakladar Debashis, Dey Shubhashis, Roy Partha Pratim, Biomedical signal processing and control . 2020,第Jula期

机译：基于EEG的心理工作量估计，使用深蓝色-LSTM网络和进化算法
4. A Study on Catastrophic Forgetting in Deep LSTM Networks [C] . Monika Schak, Alexander Gepperth International Conference on Artificial Neural Networks . 2019

机译：深层LSTM网络灾难性遗忘研究
5. Studies on Stepwise Transfer of Domain Knowledge for Computer-aided Diagnosis in Pathology Using Deep Neural Networks [D] . 曲佳 2019

机译：基于深度神经网络的病理学计算机辅助诊断领域知识的逐步转移研究
6. Overcoming catastrophic forgetting in neural networks [O] . James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, 2017

机译：克服神经网络中的灾难性遗忘
7. Big Data Analytics using Deep LSTM Networks: A Case Study for Weather Prediction [O] . Shweta Mittal, Om Prakash Sangwan 2020

机译：使用深层LSTM网络的大数据分析：天气预报的案例研究

A Study on Catastrophic Forgetting in Deep LSTM Networks

摘要

著录项

相似文献

相关主题

期刊订阅