Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

机译：深度神经网络增量学习中Fisher矩阵的简化计算和解释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Import recent advances in the domain of incremental or continual learning with DNNs, such as Elastic Weight Consolidation (EWC) or Incremental Moment Matching (IMM) rely on a quantity termed the Fisher information matrix (FIM). While the results obtained in this way are very promising, the use of the FIM relies on the assumptions that (a) the FIM can be approximated by its diagonal, and (b) that FIM diagonal entries are related to the variance of a DNN parameter in the context of Bayesian neural networks. In addition, the FIM is notoriously difficult to compute in automatic differentiation (AD) systems frameworks like TensorFlow, and existing implementations require an excessive use of memory due to this problem. We present the Matrix of SQuares (MaSQ), computed similarly as the FIM, but whose use in EWC-like algorithms follows directly from the calculus of derivatives and requires no additional assumptions. Additionally, MaSQ computation in AD frameworks is much simpler and more memory-efficient FIM computation. When using MaSQ together with EWC we show superior or equal performance to FIM/EWC on a variety of benchmark tasks.

机译：使用DNN引入增量或连续学习领域的最新进展，例如弹性权重合并（EWC）或增量矩匹配（IMM）都依赖称为Fisher信息矩阵（FIM）的数量。尽管以这种方式获得的结果很有希望，但是FIM的使用依赖于以下假设：（a）FIM可以通过其对角线近似，并且（b）FIM对角线条目与DNN参数的方差有关在贝叶斯神经网络中。此外，众所周知，FIM在诸如TensorFlow之类的自动差分（AD）系统框架中难以计算，并且由于此问题，现有的实现方式需要过多使用内存。我们介绍了SQUAES矩阵（MaSQ），其计算方式与FIM相似，但其在类似EWC的算法中的使用直接来自于导数的演算，不需要任何其他假设。此外，AD框架中的MaSQ计算是更简单且内存效率更高的FIM计算。当将MaSQ与EWC一起使用时，在各种基准测试任务上，我们表现出优于FIM / EWC的性能。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|481-494|共14页
会议地点
作者
Alexander Gepperth; Florian Wiech;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Identification and simplification of T-S fuzzy neural networks based on incremental structure learning and similarity analysis [J] . Li Wei, Qiao Junfei, Zeng Xiao-Jun, Fuzzy sets and systems . 2020,第Sepa1期

机译：基于增量结构学习和相似性分析的T-S模糊神经网络的识别与简化
2. Incremental Learning in Deep Convolutional Neural Networks Using Partial Network Sharing [J] . Sarwar Syed Shakib, Ankit Aayush, Roy Kaushik Quality Control, Transactions . 2020,第期

机译：使用部分网络共享的深度卷积神经网络中的增量学习
3. Incorporating Deep Learning with Convolutional Neural Networks and Position Specific Scoring Matrices for Identifying Electron Transport Proteins [J] . Nguyen-Quoc-Khanh Le, Quang-Thai Ho, Ou Yu-Yen Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2017,第23a24期

机译：利用卷积神经网络的深入学习和用于识别电子传输蛋白的定位特定评分矩阵
4. Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks [C] . Alexander Gepperth, Florian Wiech International Conference on Artificial Neural Networks . 2019

机译：具有深神经网络增量学习中Fisher矩阵的简化计算与解释
5. Parameter incremental learning algorithm for neural networks. [D] . Wan, Sheng. 2005

机译：神经网络的参数增量学习算法。
6. Performance of a Deep Neural Network Algorithm Based on a Small Medical Image Dataset: Incremental Impact of 3D-to-2D Reformation Combined with Novel Data Augmentation Photometric Conversion or Transfer Learning [O] . Vikash Gupta, Mutlu Demirer, Matthew Bigelow, 2020

机译：基于小型医学图像数据集的深神经网络算法的性能：3D-2D改革与新型数据增强光度转换或转移学习结合的增量影响
7. Deep Neural Networks Techniques using for Learning Automata Based Incremental Learning Method [O] . C. Swetha Reddy Et.al 2021

机译：基于自动机的增量学习方法使用深度神经网络技术

Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅