首页> 外文会议>Brazilian Conference on Intelligent Systems >Extreme Risk Averse Policy for Goal-Directed Risk-Sensitive Markov Decision Process
【24h】

Extreme Risk Averse Policy for Goal-Directed Risk-Sensitive Markov Decision Process

机译:目标导向的风险敏感马尔可夫决策过程的极端风险规避策略

获取原文

摘要

The Goal-Directed Risk-Sensitive Markov Decision Process allows arbitrary risk attitudes for the probabilistic planning problem to reach a goal state. In this problem, the risk attitude is modeled by an expected exponential utility and a risk factor λ. However, the problem is not well defined for every λ, posing the problem of defining the maximum (extreme) value for this factor. In this paper, we propose an algorithm to find this e-extreme risk factor and the corresponding optimal policy.
机译:目标导向的风险敏感马尔可夫决策过程允许概率计划问题的任意风险态度达到目标状态。在这个问题中,风险态度由预期的指数效用和风险因子λ建模。但是,对于每个λ,问题并未得到很好的定义,从而带来了为此因子定义最大值(极值)的问题。在本文中,我们提出了一种算法来找到该电子极端风险因素和相应的最优策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号