Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

机译：动态牺牲精度以减少计算量：基于Softmax置信度的级联推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the tradeoff between computational effort and classification accuracy in a cascade of deep neural networks. During inference, the user sets the acceptable accuracy degradation which then automatically determines confidence thresholds for the intermediate classifiers. As soon as the confidence threshold is met, inference terminates immediately without having to compute the output of the complete network. Confidence levels are derived directly from the softmax outputs of intermediate classifiers, as we do not train special decision functions. We show that using a softmax output as a confidence measure in a cascade of deep neural networks leads to a reduction of 15%-50% in the number of MAC operations while degrading the classification accuracy by roughly 1%. Our method can be easily incorporated into pre-trained non-cascaded architectures, as we exemplify on ResNet. Our main contribution is a method that dynamically adjusts the tradeoff between accuracy and computation without retraining the model.

机译：我们在级联的深度神经网络中研究了计算工作量和分类准确性之间的权衡。在推断过程中，用户设置可接受的精度下降，然后自动确定中间分类器的置信度阈值。一旦满足置信度阈值，推理便会立即终止，而无需计算整个网络的输出。由于我们不训练特殊决策函数，因此置信度直接来自中间分类器的softmax输出。我们证明了在深度神经网络的级联中使用softmax输出作为置信度度量会导致MAC运算数量减少15％-50％，同时将分类精度降低大约1％。正如我们在ResNet上所举例说明的那样，我们的方法可以很容易地并入预训练的非级联体系结构中。我们的主要贡献是一种无需调整模型即可动态调整精度与计算之间折衷的方法。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|306-320|共15页
会议地点
作者
Konstantin Berestizshevsky; Guy Even;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Neural networks; Efficient inference;

机译：深度学习;神经网络;高效推理;

相似文献

外文文献
中文文献
专利

1. Investigation of computational and accuracy issues in POD-based reduced order modeling of dynamic structural systems [J] . Saeed Eftekhar Azam, Stefano Mariani Engineering Structures . 2013,第sepa期

机译：基于POD的动态结构系统降阶建模中的计算和精度问题研究
2. Improving the performance of multisubject motor imagery-based BCIs using twin cascaded softmax CNNs [J] . Jing Luo, Weiwei Shi, Na Lu, Journal of neural engineering . 2021,第3期

机译：使用双级联Softmax CNNS提高基于多功能电动机图像的BCIS的性能
3. On the Accuracy of Sequence-Based Computational Inference of Protein Residues Involved in Interactions with DNA [J] . Zhenkun Gou, Igor B. Kuznetsov Trends in Applied Sciences Research . 2008,第4期

机译：基于序列的蛋白质残基与DNA相互作用的计算推断的准确性。
4. Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence [C] . Konstantin Berestizshevsky, Guy Even International Conference on Artificial Neural Networks . 2019

机译：动态牺牲减少计算的准确性：基于Softmax信心的级联推理
5. Cascade analysis synthesis channelizer structures for reduced computational workload [D] . Rassam, Faris 2015

机译：级联分析综合通道器结构，减少计算工作量
6. Discrete Information Dynamics with Confidence via the Computational Mechanics Bootstrap: Confidence Sets and Significance Tests for Information-Dynamic Measures [O] . David Darmon 2020

机译：通过计算力学自行训练的离散信息动态：信心集和信息动态措施的重要性测试
7. On the Accuracy of Sequence-Based Computational Inference of Protein Residues Involved in Interactions with DNA [O] . Zhenkun Gou ., Igor B. Kuznetsov . 2008

机译：关于与DNA相互作用蛋白质残留物的序列基计算推理的准确性

Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

摘要

著录项

相似文献

相关主题

期刊订阅