Classical Planning in MDP Heuristics: With a Little Help from Generalization

机译：MDP启发式中的经典计划：在归纳法的帮助下

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Heuristic functions make MDP solvers practical by reducing their time and memory requirements. Some of the most effective heuristics (e.g., the FF heuristic function) first deter-minize the MDP and then solve a relaxation of the resulting classical planning problem (e.g., by ignoring delete effects). While these heuristic functions are fast to compute, they frequently yield overly optimistic value estimates. It is natural to wonder, then, whether the improved estimates of using a full classical planner on the (non-relaxed) determinized domain will provide enough gains to compensate for the vastly increased cost of computation.rnThis paper shows that the answer is "No and Yes". If one uses a full classical planner in the obvious way, the cost of the heuristic function's computation outweighs the benefits. However, we show that one can make the idea practical by generalizing the results of classical planning successes and failures. Specifically, we introduce a novel heuristic function called GOTH that amortizes the cost of classical planning by 1) extracting basis functions from the plans discovered during heuristic computation, 2) using these basis functions to generalize the heuristic value of one state to cover many others, and 3) thus invoking the classical planner many fewer times than there are states. Experiments show that GOTH can provide vast time and memory savings compared to the FF heuristic function - especially on large problems.

机译：启发式功能通过减少MDP求解器的时间和内存需求使其实用。一些最有效的启发式方法（例如FF启发式功能）首先确定MDP，然后解决所产生的经典计划问题的松弛（例如，通过忽略删除效果）。尽管这些启发式函数可以快速计算，但它们经常会产生过于乐观的价值估算。那么，很自然地想知道，在（非松弛）确定域上使用完整经典规划器的改进估计值是否可以提供足够的收益来补偿大幅增加的计算成本。rn本文表明答案是“是的”。如果人们以明显的方式使用完整的经典计划器，则启发式函数的计算成本将超过收益。但是，我们表明，通过概括经典计划的成功和失败的结果，可以使该想法可行。具体来说，我们引入一种称为GOTH的新颖启发式函数，该函数通过以下方式摊销经典计划的成本：1）从启发式计算过程中发现的计划中提取基础函数，2）使用这些基础函数概括一个状态的启发式值以涵盖许多其他状态，（3）因此，调用经典计划程序的次数比状态调用次数少。实验表明，与FF启发式功能相比，GOTH可以节省大量时间和内存-特别是在大问题上。

著录项

来源
《Proceedings of the Twentieth international conference on automated planning and scheduling》|2010年|P.97-104|共8页
会议地点 Toronto(CA);Toronto(CA)
作者
Andrey Kolobov; Mausam; Daniel S. Weld;
展开▼
作者单位

Dept of Computer Science and Engineering University of Washington, Seattle WA-98195;

rnDept of Computer Science and Engineering University of Washington, Seattle WA-98195;

rnDept of Computer Science and Engineering University of Washington, Seattle WA-98195;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类管理计划和控制;
关键词

相似文献

外文文献
中文文献
专利

1. Extending Classical Planning with State Constraints: Heuristics and Search for Optimal Planning [J] . Patrik Haslum, Franc Ivankovic, Miquel Ramirez, The Journal of Artificial Intelligence Research . 2018,第8期

机译：在国家约束下扩展经典计划：启发式方法和最佳计划搜索
2. Extending Classical Planning with State Constraints: Heuristics and Search for Optimal Planning [J] . Haslum Patrik, Ivankovic Franc, Ramirez Miquel, The Journal of Artificial Intelligence Research . 2018,第期

机译：通过国家限制扩展古典规划：启发式和搜索最佳规划
3. FluCaP: A Heuristic Search Planner for First-Order MDPs [J] . Hoelldobler S., Karabaev E., Skvortsova O. The Journal of Artificial Intelligence Research . 2006,第12期

机译：FluCaP：一阶MDP的启发式搜索计划器
4. Classical Planning in MDP Heuristics: With a Little Help from Generalization [C] . Andrey Kolobov, Mausam, Daniel S. Weld International conference on automated planning and scheduling . 2010

机译：MDP启发式中的古典规划：泛化有点帮助
5. Planning under uncertainty: From informative path planning to partially observable semi-MDPs [D] . Wei, Lim Zhan. 2015

机译：不确定性下的计划：从信息路径计划到部分可观察的半MDP
6. Context/Resource-Aware Mission Planning Based on BNs and Concurrent MDPs for Autonomous UAVs [O] . Chabha Hireche, Catherine Dezan, Stéphane Mocanu, 2018

机译：基于BN和并发MDP的自主无人机的上下文/资源感知任务计划
7. FluCaP: A Heuristic Search Planner for First-Order MDPs [O] . Hoelldobler, S., Karabaev, E., Skvortsova, O. 2011

机译：FluCap：一阶mDp的启发式搜索计划程序

Classical Planning in MDP Heuristics: With a Little Help from Generalization

摘要

著录项

相似文献

相关主题

期刊订阅