SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory

Li Xiaqing; Zhang Guangyan; Zheng Weimin

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory

【24h】

SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory

机译：SmartTuning：选择ConvNet系统的超参数，以进行快速培训和小型工作记忆

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is desirable to deploy a ConvNet system with high inference accuracy, as well as fast training and small inference memory. However, existing approaches to hyper-parameter tuning only focus on high accuracy. Although achieving high accuracy, tuning poorly can significantly increase the performance burden, and thus degrade the overall performance of a ConvNet system. In this article, we propose SmartTuning, an approach to identifying the hyper-parameters of a ConvNet system for high training speed and small working memory, with the restriction of high inference accuracy. The key idea of SmartTuning is to build a new performance model for a ConvNet system, and to integrate Bayesian Optimization to learn the relationship between the overall performance and the hyper-parameters of a ConvNet system. In this way, SmartTuning can balance inference accuracy, training speed and inference memory usage during the tuning process, and thus maximizes the overall performance of a ConvNet system. Our experiments show that SmartTuning can stably identify the hyper-parameter sets that offer very close accuracy with faster training speed (i.e., 7x-11x over MNIST and 2x-3x over CIFAR-10) and much less inference memory usage (i.e., 17x-23x over MNIST and 4x-9x over CIFAR-10), compared with existing tuning approaches.

机译：期望以高推理精度和快速训练和小推理存储器部署ConvNet系统。但是，超参数调谐的现有方法仅关注高精度。虽然实现了高精度，但调整不良会显着提高性能负担，从而降低了Convnet系统的整体性能。在本文中，我们提出了SmartTuning，一种方法来识别Convnet系统的高参数，用于高训练速度和小型工作记忆，具有高推理精度的限制。 SmartTuning的关键思想是为Convnet系统构建一个新的性能模型，并集成贝叶斯优化，以了解Convnet系统的整体性能和超参数之间的关系。通过这种方式，SmartTuning可以在调整过程中平衡推理准确性，训练速度和推理内存使用量，从而最大限度地提高了ConvNet系统的整体性能。我们的实验表明，SmartTuning可以稳定地识别超参数集，以更快的训练速度（即7x-11x OVER MNIST，在CIFAR-10上为2x-3x），并且推断内存使用量更少（即，17x-与现有的调谐方法相比，在CNIST和4x-9x上的MNIST和4x-9x上。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2021年第7期|1690-1701|共12页
作者
Li Xiaqing; Zhang Guangyan; Zheng Weimin;
展开▼
作者单位

Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China;

Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China;

Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Tuning; Training; Bayes methods; Optimization; Neural networks; Performance evaluation; Memory management; Convolutional neural network; high performance computing; hyper-parameter tuning; training speed; working memory;

机译：调整;培训;贝叶斯方法;优化;神经网络;性能评估;记忆管理;卷积神经网络;高性能计算;超参数调谐;训练速度;工作记忆;

相似文献

外文文献
中文文献
专利

1. Effects of Simultaneously Performed Dual-Task Training with Aerobic Exercise and Working Memory Training on Cognitive Functions and Neural Systems in the Elderly [J] . Hikaru Takeuchi, Daniele Magistro, Yuka Kotozaki, Journal of Neural Transplantation and Plasticity: Neural Plasticity . 2020,第Supplementa1期

机译：同时进行双任务培训与有氧运动和工作记忆训练对老年人认知功能和神经系统的影响
2. Commentary: Effects of Video Game Training on Measures of Selective Attention and Working Memory in Older Adults: Results from a Randomized Controlled Trial [J] . Elzbieta Szelag Frontiers in Aging Neuroscience . 2017,第1期

机译：评论：视频游戏训练对老年人选择性注意和工作记忆的影响：随机对照试验的结果
3. Effects of Video Game Training on Measures of Selective Attention and Working Memory in Older Adults: Results from a Randomized Controlled Trial [J] . Soledad Ballesteros, Julia Mayas, Antonio Prieto, Frontiers in Aging Neuroscience . 2017,第1期

机译：电子游戏培训对老年人选择性注意和工作记忆的影响：随机对照试验的结果
4. Two-stage Incremental Working Set Selection for Fast Support Vector Training on Large Datasets [C] . DucDung NGUYEN, Kazunori MATSUMOTO, Yasuhiro TAKISHIMA, IEEE International Conference on Research, Innovation and Vision for the Future . 2008

机译：大型数据集快速支持向量训练的两阶段增量工作集选择
5. The effects of music training and selective attention on working memory during bimodal processing of auditory and visual stimuli [D] . Jones, Jennifer D. 2006

机译：音乐训练和选择性注意对听觉和视觉刺激双峰处理过程中工作记忆的影响
6. Commentary: Effects of Video Game Training on Measures of Selective Attention and Working Memory in Older Adults: Results from a Randomized Controlled Trial [O] . Elzbieta Szelag 2017

机译：评论：视频游戏训练对老年人选择性注意和工作记忆的影响：随机对照试验的结果
7. Commentary: Effects of Video Game Training on Measures of Selective Attention and Working Memory in Older Adults: Results from a Randomized Controlled Trial [O] . Elzbieta Szelag 2018

机译：评论：视频游戏训练对老年人选择性注意力和工作记忆测量的影响：随机对照试验的结果
8. Working Memory and Exploration in Training the Knowledge and Skills Required by Digital Systems. [R] . Dyer, J. L., Salter, R. S. 2001

机译：培养数字系统所需知识和技能的工作记忆与探索。

SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory

摘要

著录项

相似文献

相关主题

期刊订阅