A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

Warren B.POWELL

首页> 中文期刊> 《控制理论与应用：英文版》 >A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

We review the literature on approximate dynamic programming,with the goal of better understanding the theory behind practical algorithms for solving dynamic programs with continuous and vector-valued states and actions and complex information processes.We build on the literature that has addressed the well-known problem of multidimensional(and possibly continuous) states,and the extensive literature on model-free dynamic programming,which also assumes that the expectation in Bellman's equation cannot be computed.However,we point out complications that arise when the actions/controls are vector-valued and possibly continuous.We then describe some recent research by the authors on approximate policy iteration algorithms that offer convergence guarantees(with technical assumptions) for both parametric and nonparametric architectures for the value function.

著录项

来源
《控制理论与应用：英文版》 |2011年第3期|336-352|共17页
作者
Warren B.POWELL;
展开▼
作者单位

Department of Operations Research and Financial Engineering;

Princeton University;

展开▼
原文格式 PDF
正文语种 chi
中图分类傅里叶分析（经典调和分析）;自动控制理论;
关键词
近似动态编程; 加强学习; 最佳的控制; 近似算法;

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

摘要

著录项

相关主题

期刊订阅