Improving Weight Initialization of ReLU and Output Layers

机译：改善ReLU和输出层的权重初始化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a data-dependent weight initialization scheme for ReLU and output layers commonly found in modern neural network architectures. An initial feedforward pass through the network is performed using an initialization set (a subset of the training data set). Using statistics obtained from this pass, we initialize the weights of the network, so the following properties are met: (1) weight matrices are orthogonal; (2) ReLU layers produce a predetermined fraction of nonzero activations; (3) the outputs produced by internal layers have a predetermined variance; (4) weights in the last layer are chosen to minimize the squared error in the initialization set. We evaluate our method on popular architectures (VGG16, VGG19, and InceptionV3) and faster convergence rates are achieved on the ImageNet data set when compared to state-of-the-art initialization techniques (LSUV, He, and Glorot).

机译：我们为ReLU和现代神经网络体系结构中常见的输出层引入了一种与数据相关的权重初始化方案。使用初始化集（训练数据集的子集）执行通过网络的初始前馈。使用从此遍获得的统计信息，我们初始化网络的权重，因此可以满足以下属性：（1）权重矩阵是正交的; （2）ReLU层产生预定分数的非零激活; （3）内部层产生的输出具有预定的方差; （4）选择最后一层的权重以最小化初始化集中的平方误差。我们在流行的架构（VGG16，VGG19和InceptionV3）上评估了我们的方法，并且与最新的初始化技术（LSUV，He和Glorot）相比，在ImageNet数据集上实现了更快的收敛速度。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|170-184|共15页
会议地点
作者
Diego Aguirre; Olac Fuentes;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Weight initialization;

机译：重量初始化;

相似文献

外文文献
中文文献
专利

1. Using fuzzy partitions to create fuzzy systems from input-output data and set the initial weights in a fuzzy neural network [J] . Yinghua Lin, Cunningham G.A. III IEEE Transactions on Fuzzy Systems . 1997,第4期

机译：使用模糊分区从输入输出数据创建模糊系统，并在模糊神经网络中设置初始权重
2. Initializing Weights of a Multilayer Perceptron Network by Using the Orthogonal Least Squares Algorithm [J] . Lehtokangas M, Saarinen J, Kaski K, Neural computation . 1995,第5期

机译：正交最小二乘算法初始化多层感知器网络的权重
3. The weights initialization methodology of unsupervised neural networks to improve clustering stability [J] . Park Seongchul, Seo Sanghyun, Jeong Changhoon, Journal of supercomputing . 2020,第8期

机译：无监督神经网络的权重初始化方法，提高聚类稳定性
4. Improving Weight Initialization of ReLU and Output Layers [C] . Diego Aguirre, Olac Fuentes International Conference on Artificial Neural Networks . 2019

机译：改进Relu和输出层的重量初始化
5. Reduced Defect Density nAlGaN Template Layer for Improved Output Deep-UV LED [D] . Dion, Joseph A. 2011

机译：降低缺陷密度的nAlGaN模板层可改善输出深紫外LED
6. Improved Output Power of GaN-based VCSEL with Band-Engineered Electron Blocking Layer [O] . Huiwen Luo, Junze Li, Mo Li 2019

机译：具有带阻电子阻挡层的GaN基VCSEL的提高的输出功率
7. Estimation of initial weights and hidden units for fast learning of multilayer neural networks for pattern classification [O] . K. Keeni, K. Nakayama, H. Shimodaira -1

机译：估计初始权重和隐藏单位，以便快速学习多层神经网络模式分类

Improving Weight Initialization of ReLU and Output Layers

摘要

著录项

相似文献

相关主题

期刊订阅