首页> 外文期刊>Frontiers in Physics >Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces
【24h】

Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces

机译:具有离散状态空间的有限事件游戏的量子增强强化学习

获取原文
           

摘要

Quantum annealing algorithms belong to the class of metaheuristic tools, applicable for solving binary optimization problems. Hardware implementations of quantum annealing, such as the quantum annealing machines produced by D-Wave Systems, have been subject to multiple analyses in research, with the aim of characterizing the technology's usefulness for optimization and sampling tasks. Here, we present a way to partially embed both Monte Carlo policy iteration for finding an optimal policy on random observations, as well as how to embed n sub-optimal state-value functions for approximating an improved state-value function given a policy for finite horizon games with discrete state spaces on a D-Wave 2000Q quantum processing unit (QPU). We explain how both problems can be expressed as a quadratic unconstrained binary optimization (QUBO) problem, and show that quantum-enhanced Monte Carlo policy evaluation allows for finding equivalent or better state-value functions for a given policy with the same number episodes compared to a purely classical Monte Carlo algorithm. Additionally, we describe a quantum-classical policy learning algorithm. Our first and foremost aim is to explain how to represent and solve parts of these problems with the help of the QPU, and not to prove supremacy over every existing classical policy evaluation algorithm.
机译:量子退火算法属于元启发式工具类别,适用于解决二进制优化问题。量子退火的硬件实现,例如D-Wave Systems生产的量子退火机,已经在研究中进行了多种分析,目的是表征该技术对优化和采样任务的有用性。在这里,我们提出了一种部分嵌入蒙特卡洛策略迭代以在随机观测中找到最佳策略的方法,以及在给定有限策略的情况下如何嵌入n次优状态值函数以逼近改进的状态值函数的方法D-Wave 2000Q量子处理单元(QPU)上具有离散状态空间的水平游戏。我们解释了如何将两个问题都表示为二次无约束二进制优化(QUBO)问题,并证明了量子增强的蒙特卡洛策略评估可为给定的策略找到相同或更好的状态值函数,而与纯经典的蒙特卡洛算法。此外,我们描述了一种量子古典策略学习算法。我们的首要目标是解释如何借助QPU来表示和解决这些问题的某些部分,而不是证明对每个现有的经典策略评估算法都具有至高无上的地位。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号