An Empirical Study on the Optimal Batch Size for the Deep Q-Network

机译：深度Q网的最佳批量大小的实证研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We empirically find the optimal batch size for training the Deep Q-network on the cart-pole system. The efficiency of the training is evaluated by the performance of the network on task after training, and total time and steps required to train. The neural network is trained for 10 different sizes of batch with other hyper parameter values fixed. The network is able to carry out the cart-pole task with the probability of 0.99 or more with the batch sizes from 8 to 2048. The training time per step for training tends to increase linearly, and the total steps for training decreases more than exponentially as the batch size increases. Due to these tendencies, we could empirically observe the quadratic relationship between the total time for training and the logarithm of batch size, which is convex, and the optimal batch size that minimizes training time could also be found. The total steps and time for training are minimum at the batch size 64. This result can be expanded to other learning algorithm or tasks, and further, theoretical analysis on the relationship between the size of batch or other hyper-parameters and the efficiency of training from the optimization point of view.

机译：我们经验验证找到最佳批量尺寸，用于训练推车系统上的深Q网络。培训的效率是通过网络在培训后的任务上的性能进行评估，以及培训所需的总时间和步骤。神经网络培训，有10种不同大小的批次，具有固定的其他超参数值。网络能够从8到2048的批量尺寸执行卡车杆任务，批量尺寸为0.99或更多。每步训练训练趋于线性地增加，并且训练的总步骤比指数更低随着批量尺寸的增加。由于这些趋势，我们可以经验遵守训练总时间与批次尺寸的对数之间的二次关系，也可以找到最小化训练时间的最佳批量尺寸。批量大小的培训总步长和时间最小。该结果可以扩展到其他学习算法或任务，以及对批量大小或其他超参数之间关系的理论分析以及培训效率之间的关系。从优化的角度来看。

著录项

来源
《International Conference on Robot Intelligence Technology and Applications》|2019年|xi 579 pages :|共9页
会议地点
作者
Minsuk Choi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP242-532;
关键词
Optimal; Batch; Size;

机译：最佳;批次;尺寸;

相似文献

外文文献
中文文献
专利

1. Application of deep Q-networks for model-free optimal control balancing between different HVAC systems [J] . Science and Technology for the Built Environment . 2020,第1a5期

机译：深度Q网络在不同HVAC系统之间无模型最佳控制平衡的应用
2. Application of deep Q-networks for model-free optimal control balancing between different HVAC systems (vol 26, pg 61, 2020) [J] . Ahn Ki Uhn, Park Cheol Soo Science and Technology for the Built Environment . 2020,第6a10期

机译：在不同HVAC系统之间的无模型最佳控制平衡的深度Q网（Vol 26，PG 61,2020）中的应用
3. Dropout vs. batch normalization: an empirical study of their impact to deep learning [J] . Christian Garbin, Xingquan Zhu, Oge Marques Multimedia Tools and Applications . 2020,第19a20期

机译：辍学与批量标准化：对深度学习影响的实证研究
4. An Empirical Study on the Optimal Batch Size for the Deep Q-Network [C] . Minsuk Choi International Conference on Robot Intelligence Technology and Applications . 2019

机译：深度Q网的最佳批量大小的实证研究
5. The Effect of the Mini-Batch Size on Deep Neural Networks Training. [D] . Soto, Philippe. 2017

机译：最小批量大小对深度神经网络训练的影响。
6. Optimizing Column Length and Particle Size in Preparative Batch Chromatography Using Enantiomeric Separations of Omeprazole and Etiracetam as Models: Feasibility of Taguchi Empirical Optimization [O] . Jörgen Samuelsson, Marek Leśko, Martin Enmark, -1

机译：以奥美拉唑和依替西坦的对映体分离为模型在分批制备色谱中优化柱长和粒径：田口实证优化的可行性
7. Optimal Wireless Information and Power Transfer Using Deep Q-Network [O] . Yuan Xing, Haowen Pan, Bin Xu, 2021

机译：使用Deep Q-Network的最佳无线信息和电力传输

An Empirical Study on the Optimal Batch Size for the Deep Q-Network

摘要

著录项

相似文献

相关主题

期刊订阅