...
首页> 外文期刊>Automatic Control, IEEE Transactions on >Whittle Index for Partially Observed Binary Markov Decision Processes
【24h】

Whittle Index for Partially Observed Binary Markov Decision Processes

机译:部分观测的二进制Markov决策过程的Whittle指数

获取原文
获取原文并翻译 | 示例
           

摘要

We consider the problem of dynamically scheduling M out of N binary Markov chains when only noisy observations of state are available, with ergodic (equivalently, long run average) reward. By passing on to the equivalent problem of controlling the conditional distribution of state given observations and controls, it is cast as a restless bandit problem and its Whittle indexability is established.
机译:当只有状态的嘈杂观测值可用时,我们考虑动态调度N个二元马尔可夫链中的M个问题,并获得遍历(等效于长期平均)奖励。通过将给定的观测和控制转移到控制状态的条件分布的等效问题,将其转换为不安定的土匪问题,并建立了其Whittle可分性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号