An Empirical Study on the Optimal Batch Size for the Deep Q-Network

机译：深度Q网的最佳批量大小的实证研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We empirically find the optimal batch size for training the Deep Q-network on the cart-pole system. The efficiency of the training is evaluated by the performance of the network on task after training, and total time and steps required to train. The neural network is trained for 10 different sizes of batch with other hyper parameter values fixed. The network is able to carry out the cart-pole task with the probability of 0.99 or more with the batch sizes from 8 to 2048. The training time per step for training tends to increase linearly, and the total steps for training decreases more than exponentially as the batch size increases. Due to these tendencies, we could empirically observe the quadratic relationship between the total time for training and the logarithm of batch size, which is convex, and the optimal batch size that minimizes training time could also be found. The total steps and time for training are minimum at the batch size 64. This result can be expanded to other learning algorithm or tasks, and further, theoretical analysis on the relationship between the size of batch or other hyper-parameters and the efficiency of training from the optimization point of view.

机译：我们凭经验找到最佳的批量大小的车极系统的培训深Q-网络。培训的效率是通过对任务的网络训练后的性能评估，总时间和步骤所需的培训。神经网络进行训练，对于10点不同批次的尺寸与固定其他超参数值。网络能够与0.99或更多个与所述批量大小从8的概率车极任务进行到2048。训练时间每步训练趋于线性增加，以及用于训练的总步骤减小多于呈指数如批量大小增加而增加。由于这些趋势，我们可以观察经验的总培训时间和批量尺寸的对，这是凸的，和最佳批量大小减少了培训的时间也可能被发现之间的二次关系。用于训练的总的步骤和时间是最小的批量大小64该结果可以被扩展到其它学习算法或任务，并且进一步地，在分批或其他超参数的大小和训练的效率之间的关系的理论分析但从优化点。

著录项

来源
《International Conference on Robot Intelligence Technology and Applications》|2019年|xi 579 pages :|共9页
会议地点
作者
Minsuk Choi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP242-532;
关键词
Optimal; Batch; Size;

机译：最佳;批次;尺寸;

相似文献

外文文献
中文文献
专利

1. Application of deep Q-networks for model-free optimal control balancing between different HVAC systems [J] . Science and Technology for the Built Environment . 2020,第1a5期

机译：深度Q网络在不同HVAC系统之间无模型最佳控制平衡的应用
2. Application of deep Q-networks for model-free optimal control balancing between different HVAC systems (vol 26, pg 61, 2020) [J] . Ahn Ki Uhn, Park Cheol Soo Science and Technology for the Built Environment . 2020,第6a10期

机译：在不同HVAC系统之间的无模型最佳控制平衡的深度Q网（Vol 26，PG 61,2020）中的应用
3. Dropout vs. batch normalization: an empirical study of their impact to deep learning [J] . Christian Garbin, Xingquan Zhu, Oge Marques Multimedia Tools and Applications . 2020,第19a20期

机译：辍学与批量标准化：对深度学习影响的实证研究
4. An Empirical Study on the Optimal Batch Size for the Deep Q-Network [C] . Minsuk Choi International Conference on Robot Intelligence Technology and Applications . 2019

机译：深度Q网的最佳批量大小的实证研究
5. The Effect of the Mini-Batch Size on Deep Neural Networks Training. [D] . Soto, Philippe. 2017

机译：最小批量大小对深度神经网络训练的影响。
6. Optimizing Column Length and Particle Size in Preparative Batch Chromatography Using Enantiomeric Separations of Omeprazole and Etiracetam as Models: Feasibility of Taguchi Empirical Optimization [O] . Jörgen Samuelsson, Marek Leśko, Martin Enmark, -1

机译：以奥美拉唑和依替西坦的对映体分离为模型在分批制备色谱中优化柱长和粒径：田口实证优化的可行性
7. Optimal Wireless Information and Power Transfer Using Deep Q-Network [O] . Yuan Xing, Haowen Pan, Bin Xu, 2021

机译：使用Deep Q-Network的最佳无线信息和电力传输

An Empirical Study on the Optimal Batch Size for the Deep Q-Network

摘要

著录项

相似文献

相关主题

期刊订阅