...
首页> 外文期刊>Applied mathematics and optimization >On Risk-Sensitive Piecewise Deterministic Markov Decision Processes
【24h】

On Risk-Sensitive Piecewise Deterministic Markov Decision Processes

机译:风险敏感分段确定型马尔可夫决策过程

获取原文
获取原文并翻译 | 示例
           

摘要

We consider a piecewise deterministic Markov decision process, where the expected exponential utility of total (nonnegative) cost is to be minimized. The cost rate, transition rate and post-jump distributions are under control. The state space is Borel, and the transition and cost rates are locally integrable along the drift. Under natural conditions, we establish the optimality equation, justify the value iteration algorithm, and show the existence of a deterministic stationary optimal policy. Applied to special cases, the obtained results already significantly improve some existing results in the literature on finite horizon and infinite horizon discounted risk-sensitive continuous-time Markov decision processes.
机译:我们考虑一个分段确定的马尔可夫决策过程,其中总共(非负)成本的预期指数效用是最小化的。 成本率,过渡率和跳跃后分布都处于控制状态。 状态空间是硼,并且过渡和成本速率沿漂移局部可集成。 在自然条件下,我们建立了最优性方程,证明了价值迭代算法,展示了确定性静止最佳政策的存在。 适用于特殊情况,所获得的结果已经显着改善了有限地平线和无限地平线折扣风险敏感连续时间马尔可夫决策过程的文献中的一些现有结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号