首页> 外文会议>Machine learning >Approximating Value Trees in Structured Dynamic Programming

【24h】

Approximating Value Trees in Structured Dynamic Programming

机译：结构化动态规划中的近似值树

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose and examine a method of approximate dynamic programming for Markov decision processes based on structured problem representations. We assume an MDP is represented using a dynamic Bayesian network, and construct value functions using decision trees as our function representation. The size of the representation is kept within acceptable limits by pruning these value trees so that leaves represent possible ranges of values, thus approximating the value functions produced during optimization. We propose a method for detecting convergence, prove errors bounds on the resulting approximately optimal value functions and policies, and describe some preliminary experimental results.

机译：我们提出并研究了一种基于结构化问题表示的马尔可夫决策过程的近似动态规划方法。我们假设使用动态贝叶斯网络表示MDP，并使用决策树作为函数表示构造值函数。通过修剪这些值树，将表示的大小保持在可接受的范围内，以使叶子表示值的可能范围，从而近似优化过程中产生的值函数。我们提出了一种检测收敛性的方法，证明了由此产生的近似最优值函数和策略的误差范围，并描述了一些初步的实验结果。

著录项

来源
《Machine learning》|1996年|54-62|共9页
会议地点 Bari(IT);Bari(IT)
作者
Craig Boutilier; Richard Dearden;
展开▼
作者单位

Department of Computer Science University of British Columbia Vancouver, BC V6T 1Z4, CANADA;

Department of Computer Science University of British Columbia Vancouver, BC V6T 1Z4, CANADA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词

相似文献

外文文献
中文文献
专利

1. Towards Programmable Network Dynamics, Wi-Fi/WiGig Coordination For Optimal WiGig, Computational Complexity, Incommutability Of The Generalized Capacity, Fixed-Parameter Approximability Of Boolean Min Csps, Game Theoretic Analysis Of Tree Based Referrals [J] . K.N.P. Kumar Advances in Physics Theories and Applications . 2013,第3期

机译：面向可编程网络动力学，Wi-Fi / WiGig协调以实现最佳WiGig，计算复杂性，广义容量不可交换性，布尔Min Csps的固定参数逼近度，基于树的引荐的博弈论分析
2. Network-Level Infrastructure Management Using Approximate Dynamic Programming [J] . Kenneth D. Kuhn Journal of Infrastructure Systems . 2010,第2期

机译：使用近似动态编程的网络级基础架构管理
3. Efficient enumeration of stereoisomers of tree structured molecules using dynamic programming [J] . Tomoki Imada, Shunsuke Ota, Hiroshi Nagamochi, Journal of Mathematical Chemistry . 2011,第4期

机译：使用动态编程对树结构分子的立体异构体进行有效枚举
4. Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures [C] . Abouheaf Mohammed I., Lewis Frank L. International Joint Conference on Neural Networks . 2013

机译：使用行为者批判网络结构的多主体图形游戏的近似动态编程解决方案
5. Optimal Stochastic Scheduling of Restoration of Infrastructure Systems from Hazards: An Approximate Dynamic Programming Approach [D] . Nozhati, Saeed. 2019

机译：从危险中恢复基础设施系统的最佳随机调度：近似动态规划方法
6. Integer programming-based method for grammar-based tree compression and its application to pattern extraction of glycan tree structures [O] . Yang Zhao, Morihiro Hayashida, Tatsuya Akutsu 2010

机译：基于整数编程的基于树的树压缩方法及其在聚糖树结构模式提取中的应用
7. Efficient enumeration of stereoisomers of tree structured molecules using dynamic programming [O] . Imada Tomoki, Ota Shunsuke, Nagamochi Hiroshi, 2011

机译：使用动态编程对树结构分子的立体异构体进行有效枚举

Approximating Value Trees in Structured Dynamic Programming

摘要

著录项

相似文献

相关主题

期刊订阅