What Makes Some POMDP Problems Easy to Approximate?

机译：是什么使某些POMDP问题易于估计？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Point-based algorithms have been surprisingly successful in computing approximately optimal solutions for partially observable Markov decision processes (POMDPs) in high dimensional belief spaces. In this work, we seek to understand the belief-space properties that allow some POMDP problems to be approximated efficiently and thus help to explain the point-based algorithms' success often observed in the experiments. We show that an approximately optimal POMDP solution can be computed in time polynomial in the covering number of a reachable belief space, which is the subset of the belief space reachable from a given belief point. We also show that under the weaker condition of having a small covering number for an optimal reachable space, which is the subset of the belief space reachable under an optimal policy, computing an approximately optimal solution is NP-hard. However, given a suitable set of points that "cover" an optimal reachable space well, an approximate solution can be computed in polynomial time. The covering number highlights several interesting properties that reduce the complexity of POMDP planning in practice, e.g., fully observed state variables, beliefs with sparse support, smooth beliefs, and circulant state-transition matrices.

机译：基于点的算法在为高维置信空间中的部分可观察的马尔可夫决策过程（POMDP）计算近似最佳解决方案方面取得了令人惊讶的成功。在这项工作中，我们试图了解使某些POMDP问题得到有效逼近的置信空间属性，从而有助于解释在实验中经常观察到的基于点的算法的成功。我们表明，可以在可到达的置信空间的覆盖数中以时间多项式计算近似最佳的POMDP解，该覆盖数是从给定的置信点可到达的置信空间的子集。我们还表明，在较弱的条件下，即对于最佳可到达空间，其覆盖数较小（这是在最佳策略下可到达的信念空间的子集），计算近似最佳解是NP-hard的。但是，给定一组适当的点，这些点“很好地”覆盖了最佳的可到达空间，则可以在多项式时间内计算出近似解。涵盖的数字突出显示了一些有趣的属性，这些属性在实践中降低了POMDP规划的复杂性，例如，充分观察到的状态变量，稀疏支持的信念，平滑的信念和循环状态转换矩阵。

著录项

来源
《Annual Conference on Neural Information Processing Systems》|2007年|281-289|共8页
会议地点
作者
David Hsu; Wee Sun Lee; Nan Rong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
入库时间 2022-08-26 14:56:36

相似文献

外文文献
中文文献
专利

1. Approximate Planning in POMDPs with Weighted Graph Models [J] . Liu Yong, Lu Xingjia, Makedon Fillia International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2015,第4期

机译：具有加权图模型的POMDP中的近似规划
2. Scheduling sensors for monitoring sentient spaces using an approximate POMDP policy [J] . Ronen Vaisenberg, Alessio Della Motta, Sharad Mehrotra, Pervasive and Mobile Computing . 2014,第Pta1期

机译：使用近似的POMDP策略调度传感器以监视感知空间
3. Monte Carlo Sampling Methods for Approximating Interactive POMDPs [J] . Doshi P., Gmytrasiewicz P. J The Journal of Artificial Intelligence Research . 2009,第4期

机译：近似交互式POMDP的蒙特卡洛采样方法
4. What Makes Some POMDP Problems Easy to Approximate? [C] . Annual Conference on Neural Information Processing Systems . 2007

机译：是什么让一些POMDP问题易于近似？
5. Phase transitions and typical-case complexity: Easy (hard) aspects of hard (easy) problems. [D] . Gao, Yong. 2005

机译：相变和典型案例的复杂性：困难（容易）问题的容易（困难）方面。
6. Forward and Backward Bellman Equations Improve the Efficiency of the EM Algorithm for DEC-POMDP [O] . Takehiro Tottori, Tetsuya J. Kobayashi 2021

机译：向前和后退Bellman方程提高了DEC-POMDP的EM算法的效率
7. Monte Carlo Sampling Methods for Approximating Interactive POMDPs [O] . Doshi, Prashant, Gmytrasiewicz, Piotr J. 2014

机译：用于逼近交互式pOmDp的蒙特卡罗采样方法

What Makes Some POMDP Problems Easy to Approximate?

摘要

著录项

相似文献

相关主题

期刊订阅