Learning Depth-First Search: A Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings, and its application to MDPs

机译：学习深度优先搜索：确定性和非确定性环境中启发式搜索的统一方法及其在MDP中的应用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dynamic Programming provides a convenient and unified framework for studying many state models used in AI but no algorithms for handling large spaces. Heuristic-search methods, on the other hand, can handle large spaces but lack a common foundation. In this work, we combine the benefits of a general dynamic programming formulation with the power of heuristic-search techniques for developing an algorithmic framework, that we call Learning Depth-First Search, that aims to be both general and effective. LDFS is a simple piece of code that performs iterated depth-first searches enhanced with learning. For deterministic actions and monotone value functions, LDFS reduces to IDA~* with transposition tables, while for Game Trees, to the state-of-the-art iterated Alpha-Beta search algorithm with Null Windows known as MTD. For other models, like AND/OR graphs and MDPs, LDFS yields new, simple, and competitive algorithms. We show this here for MDPs.

机译：动态编程为研究AI中使用的许多状态模型提供了一个方便且统一的框架，但没有用于处理大空间的算法。另一方面，启发式搜索方法可以处理较大的空间，但缺乏通用的基础。在这项工作中，我们将通用动态规划公式化的好处与启发式搜索技术的力量相结合，以开发一种算法框架，我们将其称为学习深度优先搜索，旨在既通用又有效。 LDFS是一段简单的代码，它可以执行迭代的深度优先搜索，从而增强了学习能力。对于确定性动作和单调值函数，LDFS通过换位表简化为IDA〜*，而对于Game Trees，则简化为具有Null Windows的最新迭代Alpha-Beta搜索算法，称为MTD。对于其他模型，例如AND / OR图和MDP，LDFS产生了新的，简单的和有竞争力的算法。我们在这里为MDP展示此内容。

著录项

来源
《International Conference on Automated Planning and Scheduling(ICAPS 2006); 2006;》|2006年|P.142-151|共10页
会议地点
作者
Blai Bonet; Hector Geffner;
展开▼
作者单位

Departamento de Computacion Universidad Simon Bolivar Caracas, Venezuela;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 N12;
关键词

相似文献

外文文献
中文文献
专利

1. FluCaP: A Heuristic Search Planner for First-Order MDPs [J] . Hoelldobler S., Karabaev E., Skvortsova O. The Journal of Artificial Intelligence Research . 2006,第12期

机译：FluCaP：一阶MDP的启发式搜索计划器
2. FluCaP: A Heuristic Search Planner for First-Order MDPs [J] . S. Hoelldobler, E. Karabaev, O. Skvortsova Journal of Automation, Mobile Robotics & Intelligent Systems . 2006,第5期

机译：FluCaP：一阶MDP的启发式搜索计划器
3. Comparison of Decisions Quality of Heuristic Methods with Limited Depth-First Search Techniques in the Graph Shortest Path Problem [J] . Eduard Vatutin Open Engineering . 2017,第1期

机译：图最短路径问题中启发式方法与有限深度优先搜索技术的决策质量比较
4. Depth-First Proof-Number Search with Heuristic Edge Cost and Application to Chemical Synthesis Planning [C] . Akihiro Kishimoto, Beat Buesser, Bei Chen, Conference on Neural Information Processing Systems . 2020

机译：具有启发式边缘成本和应用于化学合成规划的深度优先校样搜索
5. A case-based reasoning and inductive learning approach for heuristic search. [D] . Krovvidy, Srinivas. 1992

机译：基于案例的推理和归纳学习方法用于启发式搜索。
6. MDPs with Non-Deterministic Policies [O] . Mahdi Milani Fard, Joelle Pineau -1

机译：具有不确定性策略的MDP
7. FluCaP: A Heuristic Search Planner for First-Order MDPs [O] . Hoelldobler, S., Karabaev, E., Skvortsova, O. 2011

机译：FluCap：一阶mDp的启发式搜索计划程序

Learning Depth-First Search: A Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings, and its application to MDPs

摘要

著录项

相似文献

相关主题

期刊订阅