Accelerating Deep Q Network by Weighting Experiences

机译：通过加权经验来加速Deep Q网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Q Network (DQN) is a reinforcement learning methodl-ogy that uses deep neural networks to approximate the Q-function. Literature reveals that DQN can select better responses than humans. However, DQN requires a lengthy period of time to learn the appropriate actions by using tuples of state, action, reward and next state, called "experience", sampled from its memory. DQN samples them uniformly and randomly, but the experiences are skewed resulting in slow learning because frequent experiences are redundantly sampled but infrequent ones are not. This work mitigates the problem by weighting experiences based on their frequency and manipulating their sampling probability. In a video game environment, the proposed method learned the appropriate responses faster than DQN.

机译：深度Q网络（DQN）是一种强化学习方法，它使用深度神经网络来近似Q函数。文献表明，DQN可以选择比人类更好的反应。但是，DQN需要花费很长的时间来学习通过使用从其内存中采样的状态，操作，奖励和下一个状态（称为“体验”）的元组来学习适当的操作。 DQN对他们进行了统一且随机的采样，但是由于经常性的经验被多余地采样，而很少的经验则没有，因此这些经验被歪曲了，导致学习速度变慢。这项工作通过根据经验的频率加权经验并控制其采样概率来缓解此问题。在视频游戏环境中，所提出的方法比DQN更快地学会了适当的响应。

著录项

来源
《International conference on neural information processing;Annual conference of Asia-Pacific Neural Network Society》|2018年|204-213|共10页
会议地点
作者
Kazuhiro Murakami; Koichi Moriyama; Atsuko Mutoh; Tohgoroh Matsui; Nobuhiro Inuzuka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reinforcement learning; Deep learning;

机译：强化学习;深度学习;

相似文献

外文文献
中文文献
专利

1. Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting [J] . Ali Waleed, Ahmed Adel A. Information Security, IET . 2019,第6期

机译：使用基于遗传算法的特征选择和加权的深度神经网络的混合智能网络钓鱼网站预测
2. Change Detection From Synthetic Aperture Radar Images Based on Channel Weighting-Based Deep Cascade Network [J] . Gao Yunhao, Gao Feng, Dong Junyu, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2019,第11期

机译：基于信道加权的深级级级联网络从合成孔径雷达图像的变化检测
3. Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip [J] . Zhang Yang, Cui Menglin, Shen Linlin, Cybernetics, IEEE Transactions on . 2021,第4期

机译：回忆量化神经网络：一种加快芯片深度学习的新方法
4. Accelerating Deep Q Network by Weighting Experiences [C] . Kazuhiro Murakami, Koichi Moriyama, Atsuko Mutoh, International Conference on Neural Information Processing . 2018

机译：通过加权体验加速深度Q网络
5. Kernel Mechanisms for Efficient GPU Accelerated Deep Neural Network Inference on Embedded Devices [D] . Nigam, Hemant. 2018

机译：高效GPU的内核机制加速了对嵌入式设备的深神经网络推断
6. Improving the Reliability of Network Metrics in Structural Brain Networks by Integrating Different Network Weighting Strategies into a Single Graph [O] . Stavros I. Dimitriadis, Mark Drakesmith, Sonya Bells, 2017

机译：通过将不同的网络加权策略集成到单个图中来提高结构性脑网络中网络指标的可靠性
7. A New Online Class-Weighting Approach with Deep Neural Networks for Image Segmentation of Highly Unbalanced Glioblastoma Tumors [O] . Mostefa Ben Naceur, Rostom Kachouri, Mohamed Akil, 2019

机译：一种新的在线类加权方法，具有深度神经网络，用于高度不平衡胶质母细胞瘤肿瘤的图像分割
8. Space Weather and the Tracking of Deep Space Probes by the NASA Deep Space Network (DSN): The Galileo and SOHO LASCO Experience [R] . Woo, R., Pierson, R., Taylor, J., 1997

机译：美国国家航空航天局深空网络（DsN）的太空天气和深空探测器跟踪：伽利略和sOHO LasCO体验

Accelerating Deep Q Network by Weighting Experiences

摘要

著录项

相似文献

相关主题

期刊订阅