Retrain-Less Weight Quantization for Multiplier-Less Convolutional Neural Networks

Choi Jaewoong; Kong Byeong Yong; Park In-Cheol

首页> 外文期刊>Circuits and Systems I: Regular Papers, IEEE Transactions on >Retrain-Less Weight Quantization for Multiplier-Less Convolutional Neural Networks

【24h】

Retrain-Less Weight Quantization for Multiplier-Less Convolutional Neural Networks

机译：乘法重量量化倍数较少的卷积神经网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article presents an approximate signed digit representation (ASD) which quantizes the weights of convolutional neural networks (CNNs) in order to make multiplier-less CNNs without performing any retraining process. Unlike the existing methods that necessitate retraining for weight quantization, the proposed method directly converts full-precision weights of CNN models into low-precision ones, attaining accuracy comparable to that of full-precision models on the Image classification tasks without going through retraining. Therefore, it is effective in saving the retraining time as well as the related computational cost. As the proposed method simplifies the weights to have up to two non-zero digits, multiplication can be realized with only add and shift operations, resulting in a speed-up of inference time and a reduction of energy consumption and hardware complexity. Experiments conducted for famous CNN architectures, such as AlexNet, VGG-16, ResNet-18 and SqueezeNet, show that the proposed method reduces the model size by 73% at the cost of a little increase of error rate, which ranges from 0.09% to 1.5% on ImageNet dataset. Compared to the previous architecture built with multipliers, the proposed multiplier-less convolution architecture reduces the critical-path delay by 52% and mitigates the hardware complexity and power consumption by more than 50%.

机译：本文呈现了近似有符号的符号符号表示（ASD），其量化卷积神经网络（CNNS）的权重，以便在不执行任何再掠过程的情况下使乘数的CNN制作。与需要刷新重量量化的现有方法不同，所提出的方法直接将CNN模型的全精度重量转换为低精度，实现与图像分类任务上的全精度模型的精度相当，而不通过再培训。因此，它可以有效地保存再润滑时间以及相关的计算成本。由于所提出的方法简化了最多两个非零位的权重，只有添加和移位操作可以实现乘法，导致推理时间的加速和能量消耗和硬件复杂度的减少。针对着名的CNN架构进行的实验，例如AlexNet，VGG-16，Reset-18和Squeezenet，表明该方法以略微增加的成本降低了73％的模型大小，从0.09％到0.09％在想象中数据集1.5％。与具有乘数构建的以前的架构相比，所提出的乘法器较少的卷积架构将临界路径延迟减少52％并减轻硬件复杂性和功耗超过50％。

著录项

来源
《Circuits and Systems I: Regular Papers, IEEE Transactions on》 |2020年第3期|972-982|共11页
作者
Choi Jaewoong; Kong Byeong Yong; Park In-Cheol;
展开▼
作者单位

Korea Adv Inst Sci & Technol Sch Elect Engn Daejeon 34141 South Korea;

Kongju Natl Univ Div Elect Elect & Control Engn Cheonan 31080 South Korea;

Korea Adv Inst Sci & Technol Sch Elect Engn Daejeon 34141 South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convolutional neural networks (CNNs); convolution architecture; multiplier-less structure; canonical signed digit (CSD); weight quantization; retraining;

机译：卷积神经网络（CNNS）;卷积架构;乘法器结构;规范签名的数字（CSD）;重量量化;再培训;

相似文献

外文文献
中文文献
专利

1. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer [J] . Sanghyun Seo, Juntae Kim Applied Sciences . 2019,第12期

机译：基于基于核密度估计的非统一量化器的高效权重量化卷积神经网络
2. Variable weighted convolutional neural network for the nitrogen content quantization of Masson pine seedling leaves with near-infrared spectroscopy [J] . Ni Chao, Wang Dongyi, Tao Yang Spectrochimica acta, Part A. Molecular and biomolecular spectroscopy . 2019,第期

机译：具有近红外光谱法的马龙松幼苗叶片氮素含量量化的可变加权卷积神经网络
3. Space Efficient Quantization for Deep Convolutional Neural Networks [J] . Dong-Di Zhao, Fan Li, Kashif Sharif, 计算机科学技术学报（英文版） . 2019,第002期

机译：深卷积神经网络的空间有效量化
4. Hybrid Approach for Efficient Quantization of Weights in Convolutional Neural Networks [C] . Sanghyun Seo, Juntae Kim IEEE International Conference on Big Data and Smart Computing . 2018

机译：卷积神经网络中有效量化权重的混合方法
5. Pipelined Training with Stale Weights of Deep Convolutional Neural Networks [D] . ?Zhang, Lifu 2020

机译：流水线训练与深卷积神经网络的陈旧重量
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. A Fixed-Point Quantization Technique for Convolutional Neural Networks Based on Weight Scaling [O] . Norbert Mitschke, Michael Heizmann, Klaus-Henning Noffz, 2019

机译：基于体重缩放的卷积神经网络的定量点量化技术

Retrain-Less Weight Quantization for Multiplier-Less Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅