首页>
外国专利>
NON-UNIFORM QUANTIZATION OF PRE-TRAINED DEEP NEURAL NETWORK
NON-UNIFORM QUANTIZATION OF PRE-TRAINED DEEP NEURAL NETWORK
展开▼
机译:预处理深层神经网络的非均匀量化
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and a method of quantizing a pre-trained neural network, includes determining by a layer/channel bit-width determiner for each layer or channel of the pre-trained neural network a minimum quantization noise for the layer or the channel for each master bit-width value in a predetermined set of master bit-width values; and selecting by a bit-width selector for the layer or the channel the master bit-width value having the minimum quantization noise for the layer or the channel. In one embodiment, the minimum quantization noise for the layer or the channel is based on a square of a range of weights for the layer or the channel that is multiplied by a constant to a negative power of a current master bit-width value.
展开▼