Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

机译：具有深神经网络增量学习中Fisher矩阵的简化计算与解释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Import recent advances in the domain of incremental or continual learning with DNNs, such as Elastic Weight Consolidation (EWC) or Incremental Moment Matching (IMM) rely on a quantity termed the Fisher information matrix (FIM). While the results obtained in this way are very promising, the use of the FIM relies on the assumptions that (a) the FIM can be approximated by its diagonal, and (b) that FIM diagonal entries are related to the variance of a DNN parameter in the context of Bayesian neural networks. In addition, the FIM is notoriously difficult to compute in automatic differentiation (AD) systems frameworks like TensorFlow, and existing implementations require an excessive use of memory due to this problem. We present the Matrix of SQuares (MaSQ), computed similarly as the FIM, but whose use in EWC-like algorithms follows directly from the calculus of derivatives and requires no additional assumptions. Additionally, MaSQ computation in AD frameworks is much simpler and more memory-efficient FIM computation. When using MaSQ together with EWC we show superior or equal performance to FIM/EWC on a variety of benchmark tasks.

机译：使用DNN的增量或持续学习领域的进口近期进步，例如弹性重量整合（EWC）或增量时刻匹配（IMM）依赖于被称为Fisher信息矩阵（FIM）的数量。虽然以这种方式获得的结果非常有前途，但FIM的使用依赖于（a）FIM可以通过其对角线近似的假设，并且（b）FIM对角线条目与DNN参数的方差有关在贝叶斯神经网络的背景下。此外，FIM难以在自动分化（AD）系统框架中计算，如TensorFlow，并且现有的实现导致由于此问题而过度使用内存。我们介绍了正方形（MASQ）的矩阵，类似地计算为FIM，但其在EWC样算法中的用途直接从衍生物的微积分中遵循并且不需要额外的假设。此外，广告框架中的MASQ计算更简单，更高的内存高效计算。使用MASQ与EWC一起使用我们对各种基准任务的FIM / EWC显示出优越或平等的性能。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|xxx 807 p.|共14页
会议地点
作者
Alexander Gepperth; Florian Wiech;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Identification and simplification of T-S fuzzy neural networks based on incremental structure learning and similarity analysis [J] . Li Wei, Qiao Junfei, Zeng Xiao-Jun, Fuzzy sets and systems . 2020,第Sepa1期

机译：基于增量结构学习和相似性分析的T-S模糊神经网络的识别与简化
2. Incremental Learning in Deep Convolutional Neural Networks Using Partial Network Sharing [J] . Sarwar Syed Shakib, Ankit Aayush, Roy Kaushik Quality Control, Transactions . 2020,第期

机译：使用部分网络共享的深度卷积神经网络中的增量学习
3. Incorporating Deep Learning with Convolutional Neural Networks and Position Specific Scoring Matrices for Identifying Electron Transport Proteins [J] . Nguyen-Quoc-Khanh Le, Quang-Thai Ho, Ou Yu-Yen Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2017,第23a24期

机译：利用卷积神经网络的深入学习和用于识别电子传输蛋白的定位特定评分矩阵
4. Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks [C] . Alexander Gepperth, Florian Wiech International Conference on Artificial Neural Networks . 2019

机译：深度神经网络增量学习中Fisher矩阵的简化计算和解释
5. Parameter incremental learning algorithm for neural networks. [D] . Wan, Sheng. 2005

机译：神经网络的参数增量学习算法。
6. Performance of a Deep Neural Network Algorithm Based on a Small Medical Image Dataset: Incremental Impact of 3D-to-2D Reformation Combined with Novel Data Augmentation Photometric Conversion or Transfer Learning [O] . Vikash Gupta, Mutlu Demirer, Matthew Bigelow, 2020

机译：基于小型医学图像数据集的深神经网络算法的性能：3D-2D改革与新型数据增强光度转换或转移学习结合的增量影响
7. Deep Neural Networks Techniques using for Learning Automata Based Incremental Learning Method [O] . C. Swetha Reddy Et.al 2021

机译：基于自动机的增量学习方法使用深度神经网络技术

Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅