【24h】

Probabilistic Planning with Nonlinear Utility Functions

机译:非线性效用函数的概率规划

获取原文
获取原文并翻译 | 示例

摘要

Researchers often express probabilistic planning problems as Markov decision process models and then maximize the expected total reward. However, it is often rational to maximize the expected utility of the total reward for a given nonlinear utility function, for example, to model attitudes towards risk in high-stake decision situations. In this paper, we give an overview of basic techniques for probabilistic planning with nonlinear utility functions, including functional value iteration and a backward induction method for one-switch utility functions.
机译:研究人员通常将概率性计划问题表达为马尔可夫决策过程模型,然后使预期的总回报最大化。但是,对于给定的非线性效用函数,最大化总报酬的预期效用通常是合理的,例如,在高风险决策情况下对风险的态度建模。在本文中,我们概述了具有非线性效用函数的概率规划的基本技术,包括函数值迭代和单开关效用函数的后向归纳方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号