...
首页> 外文期刊>Journal of Economic Dynamics and Control >Valuing Programs With Deterministic And Stochastic Cycles
【24h】

Valuing Programs With Deterministic And Stochastic Cycles

机译:评估具有确定性和随机周期的程序

获取原文
获取原文并翻译 | 示例
           

摘要

In many dynamic programming problems, a mix of state variables exists - some exhibiting stochastic cycles and others having deterministic cycles. We derive a formula for the value function in infinite-horizon, stationary, Markovian decision problems by exploiting a special partitioned-circulant structure of the transition matrix Π. Our strategy for computing the left-inverse of the matrix [I-βΠ], which is central to implementing Howard's policy iteration algorithm, yields significant improvements in computation time and major reductions in memory required. When the deterministic cycle is of order n, our cyclic inversion algorithm yields an O(n~2) speed-up relative to the usual policy iteration algorithm.
机译:在许多动态编程问题中,存在状态变量的混合-一些状态变量显示为随机周期,其他状态变量为确定性周期。我们通过利用转移矩阵Π的特殊划分-循环结构,推导了无限水平,平稳,马尔可夫决策问题中的值函数公式。我们用于计算矩阵[I-βΠ]的左逆的策略(对于实施霍华德的策略迭代算法至关重要),可显着改善计算时间,并显着减少所需的内存。当确定性周期为n阶时,相对于常规策略迭代算法,我们的循环反演算法会产生O(n〜2)加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号