首页> 外文会议>International Conference on Automated Planning and Scheduling(ICAPS 2006); 2006; >Learning Depth-First Search: A Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings, and its application to MDPs
【24h】

Learning Depth-First Search: A Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings, and its application to MDPs

机译:学习深度优先搜索:确定性和非确定性环境中启发式搜索的统一方法及其在MDP中的应用

获取原文
获取原文并翻译 | 示例

摘要

Dynamic Programming provides a convenient and unified framework for studying many state models used in AI but no algorithms for handling large spaces. Heuristic-search methods, on the other hand, can handle large spaces but lack a common foundation. In this work, we combine the benefits of a general dynamic programming formulation with the power of heuristic-search techniques for developing an algorithmic framework, that we call Learning Depth-First Search, that aims to be both general and effective. LDFS is a simple piece of code that performs iterated depth-first searches enhanced with learning. For deterministic actions and monotone value functions, LDFS reduces to IDA~* with transposition tables, while for Game Trees, to the state-of-the-art iterated Alpha-Beta search algorithm with Null Windows known as MTD. For other models, like AND/OR graphs and MDPs, LDFS yields new, simple, and competitive algorithms. We show this here for MDPs.
机译:动态编程为研究AI中使用的许多状态模型提供了一个方便且统一的框架,但没有用于处理大空间的算法。另一方面,启发式搜索方法可以处理较大的空间,但缺乏通用的基础。在这项工作中,我们将通用动态规划公式化的好处与启发式搜索技术的力量相结合,以开发一种算法框架,我们将其称为学习深度优先搜索,旨在既通用又有效。 LDFS是一段简单的代码,它可以执行迭代的深度优先搜索,从而增强了学习能力。对于确定性动作和单调值函数,LDFS通过换位表简化为IDA〜*,而对于Game Trees,则简化为具有Null Windows的最新迭代Alpha-Beta搜索算法,称为MTD。对于其他模型,例如AND / OR图和MDP,LDFS产生了新的,简单的和有竞争力的算法。我们在这里为MDP展示此内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号