Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling

Kyowoon Lee; Sol-A Kim; Jaesik Choi; Seong-Whan Lee

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling

【24h】

Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling

机译：连续动作空间中的深度强化学习：以模拟冰壶游戏为例

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many real-world applications of reinforcement learning require an agent to select optimal actions from continuous spaces. Recently, deep neural networks have successfully been applied to games with discrete actions spaces. However, deep neural networks for discrete actions are not suitable for devising strategies for games where a very small change in an action can dramatically affect the outcome. In this paper, we present a new self-play reinforcement learning framework which equips a continuous search algorithm which enables to search in continuous action spaces with a kernel regression method. Without any hand-crafted features, our network is trained by supervised learning followed by self-play reinforcement learning with a high-fidelity simulator for the Olympic sport of curling. The program trained under our framework outperforms existing programs equipped with several hand-crafted features and won an international digital curling competition.

机译：增强学习的许多实际应用都需要代理从连续空间中选择最佳动作。最近，深度神经网络已成功应用于具有离散动作空间的游戏。然而，用于离散动作的深度神经网络不适用于设计游戏策略，在这种策略中，动作的很小变化会严重影响结果。在本文中，我们提出了一种新的自我扮演强化学习框架，该框架配备了一种连续搜索算法，该算法可以使用核回归方法在连续动作空间中进行搜索。由于没有任何手工制作的功能，我们的网络在监督学习的基础上进行训练，然后通过高保真模拟器进行自我练习强化学习，以进行奥林匹克冰壶运动。在我们的框架下训练的程序胜过配备了几个手工功能的现有程序，并赢得了国际数字冰壶比赛。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共10页
作者
Kyowoon Lee; Sol-A Kim; Jaesik Choi; Seong-Whan Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces [J] . Sawada Ryohei, Sato Keiji, Majima Takahiro Journal of marine science and technology . 2021,第2期

机译：使用LSTM在连续动作空间中使用深度增强学习自动船舶碰撞
2. Energy management of hybrid electric bus based on deep reinforcement learning in continuous state and action space [J] . Tan Huachun, Zhang Hailong, Peng Jiankun, Energy Conversion & Management . 2019,第SEPa期

机译：基于连续状态和动作空间深度强化学习的混合动力电动客车能源管理
3. Energy management of hybrid electric bus based on deep reinforcement learning in continuous state and action space [J] . Tan Huachun, Zhang Hailong, Peng Jiankun, Energy Conversion & Management . 2019,第Sepa期

机译：基于深增强学习的混合动力电动总线在连续状态和动作空间中的能源管理
4. Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces [C] . Haotian Fu, Hongyao Tang, Jianye Hao, International Joint Conference on Artificial Intelligence . 2020

机译：具有离散连续混合动态空间的深层多智能经纪增强学习
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning [O] . JoonBum Leem, Ha Young Kim, Baogui Xin, 2020

机译：采用深度加固学习采用延长离散动作空间的行动专业专业专家集合交易系统
7. Goal-Oriented Obstacle Avoidance with Deep Reinforcement Learning in Continuous Action Space [O] . Reinis Cimurs, Jin Han Lee, Il Hong Suh 2020

机译：面向目标的障碍避免持续动作空间中的深度增强学习

Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling

摘要

著录项

相似文献

相关主题

期刊订阅