Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces

Neukart Florian; Von Dollen David; Seidel Christian; Compostella Gabriele

首页> 外文期刊>Frontiers in Physics >Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces

【24h】

Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces

机译：具有离散状态空间的有限事件游戏的量子增强强化学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Quantum annealing algorithms belong to the class of metaheuristic tools, applicable for solving binary optimization problems. Hardware implementations of quantum annealing, such as the quantum annealing machines produced by D-Wave Systems, have been subject to multiple analyses in research, with the aim of characterizing the technology's usefulness for optimization and sampling tasks. Here, we present a way to partially embed both Monte Carlo policy iteration for finding an optimal policy on random observations, as well as how to embed n sub-optimal state-value functions for approximating an improved state-value function given a policy for finite horizon games with discrete state spaces on a D-Wave 2000Q quantum processing unit (QPU). We explain how both problems can be expressed as a quadratic unconstrained binary optimization (QUBO) problem, and show that quantum-enhanced Monte Carlo policy evaluation allows for finding equivalent or better state-value functions for a given policy with the same number episodes compared to a purely classical Monte Carlo algorithm. Additionally, we describe a quantum-classical policy learning algorithm. Our first and foremost aim is to explain how to represent and solve parts of these problems with the help of the QPU, and not to prove supremacy over every existing classical policy evaluation algorithm.

机译：量子退火算法属于元启发式工具类别，适用于解决二进制优化问题。量子退火的硬件实现，例如D-Wave Systems生产的量子退火机，已经在研究中进行了多种分析，目的是表征该技术对优化和采样任务的有用性。在这里，我们提出了一种部分嵌入蒙特卡洛策略迭代以在随机观测中找到最佳策略的方法，以及在给定有限策略的情况下如何嵌入n次优状态值函数以逼近改进的状态值函数的方法D-Wave 2000Q量子处理单元（QPU）上具有离散状态空间的水平游戏。我们解释了如何将两个问题都表示为二次无约束二进制优化（QUBO）问题，并证明了量子增强的蒙特卡洛策略评估可为给定的策略找到相同或更好的状态值函数，而与纯经典的蒙特卡洛算法。此外，我们描述了一种量子古典策略学习算法。我们的首要目标是解释如何借助QPU来表示和解决这些问题的某些部分，而不是证明对每个现有的经典策略评估算法都具有至高无上的地位。

著录项

来源
《Frontiers in Physics》 |2017年第9期|共页
作者
Neukart Florian; Von Dollen David; Seidel Christian; Compostella Gabriele;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类物理学;
关键词
Quantum AnnealingQuantum computingreinforcement learningQuantum-enhanced algorithmsQuantum-classical;

机译：量子退火量子计算强化学习量子增强算法量子经典;

相似文献

外文文献
中文文献
专利

1. Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach [J] . Alsaleh Rushdi, Sayed Tarek Transportation research . 2021,第Jula期

机译：广播空间中骑自行车者行人互动的马尔可夫 - 游戏模型
2. Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling [J] . Kyowoon Lee, Sol-A Kim, Jaesik Choi, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：连续动作空间中的深度强化学习：以模拟冰壶游戏为例
3. Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces [J] . Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu Performance evaluation review . 2020,第1期

机译：公原空间中的焦化加固学习的自适应离散化
4. Binary Black-Box Attacks Against Static Malware Detectors with Reinforcement Learning in Discrete Action Spaces [C] . Mohammadreza Ebrahimi, Jason Pacheco, Weifeng Li, IEEE Security and Privacy Workshops . 2021

机译：二进制黑匣子攻击静态恶意软件探测器，在离散动作空间中的加固学习
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning [O] . JoonBum Leem, Ha Young Kim, Baogui Xin, 2020

机译：采用深度加固学习采用延长离散动作空间的行动专业专业专家集合交易系统
7. Quantum-enhanced reinforcement learning for finite-episode games with discrete state spaces [O] . Neukart, Florian, Von Dollen, David, Seidel, Christian, 2017

机译：量子强化强化学习有限集数游戏离散状态空间

Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces

摘要

著录项

相似文献

相关主题

期刊订阅