首页> 外文会议>International Conference on Automated Planning and Scheduling(ICAPS 2006); 2006; >Solving Factored MDPs with Exponential-Family Transition Models
【24h】

Solving Factored MDPs with Exponential-Family Transition Models

机译:用指数-家庭转移模型求解因式分解的MDP

获取原文
获取原文并翻译 | 示例

摘要

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea of the approach is to approximate the optimal value function by a linear combination of basis functions and optimize it by linear programming. In this paper, we extend the existing HALP paradigm beyond the mixture of beta transition model. As a consequence, we permit other transition functions, such as normals, without approximating them. Moreover, we identify a large class of basis functions that match these transition models and yield an efficient solution to the expectation terms in HALP. Finally, we apply the generalized HALP framework to solve a rover planning problem, which involves continuous time and resource uncertainty.
机译:具有离散和连续状态和动作分量的马尔可夫决策过程(MDP)可以通过混合近似线性规划(HALP)有效地解决。该方法的主要思想是通过基函数的线性组合来逼近最佳值函数,并通过线性编程对其进行优化。在本文中,我们将现有的HALP范式扩展到β过渡模型的混合之外。因此,我们允许其他过渡功能(例如法线)不近似。此外,我们确定了与这些过渡模型匹配的一大类基础函数,并为HALP中的期望项提供了有效的解决方案。最后,我们应用广义的HALP框架来解决漫游者计划问题,该问题涉及连续的时间和资源不确定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号