A Probabilistic Greedy Search Value Iteration Algorithm for POMDP

机译：POMDP的概率贪婪搜索值迭代算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Point-based value iteration methods are a class of effective algorithms for solving POMDP model. Although MDP-based algorithms such as FSVI can reduce the complexity and improve efficiency greatly by using the optimal strategy of the underlying MDP, the excessive randomness of these algorithms makes them not suitable for the realistic POMDP problems. A probabilistic greedy search value iteration algorithm (PGSVI) is presented in the paper. PGSVI selects action according to the weighted reward, probabilistic greedy explores the state for the next horizon based on belief state and the transition function, then samples observation from observations whose observation probability is greater than a threshold. PGSVI makes up the shortage of FSVI algorithm and ensures the efficiency by selecting more rational actions, states and observations during the exploration. Experiment results of four benchmarks show that PGSVI is very competitive with FSVI in POMDP problems with large-scale observations.

机译：基于点的值迭代方法是解决POMDP模型的一类有效算法。尽管基于MDP的算法（例如FSVI）可以通过使用底层MDP的最佳策略来大大降低复杂度并提高效率，但是这些算法的过度随机性使其不适用于实际的POMDP问题。提出了一种概率贪婪搜索值迭代算法（PGSVI）。 PGSVI根据加权奖励选择操作，概率贪婪基于信念状态和过渡函数探索下一个地平线的状态，然后从观察概率大于阈值的观察中采样观察。 PGSVI弥补了FSVI算法的不足，并通过在勘探过程中选择更多合理的动作，状态和观测值来确保效率。四个基准测试的实验结果表明，在大规模观测的POMDP问题中，PGSVI与FSVI竞争非常激烈。

著录项

来源
《》|2016年|926-929|共4页
会议地点
作者
Feng Liu; Zebang Song;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Algorithm design and analysis; Approximation algorithms; Probabilistic logic; Complexity theory; Benchmark testing; Trajectory; Space exploration;

机译：算法设计与分析;逼近算法;概率逻辑;复杂性理论;基准测试;弹道;空间探索;

相似文献

外文文献
中文文献
专利

1. Iterated-greedy-based algorithms with beam search initialization for the permutation flowshop to minimise total tardiness [J] . Fernandez-Viagas Victor, Valente Jorge M. S., Framinan Jose M. Expert Systems with Application . 2018,第MARa期

机译：基于迭代贪婪的算法，针对置换流水车间进行波束搜索初始化，以最大程度地减少总拖延
2. Using iterated greedy and randomized iterated greedy algorithms to solve urban area waste collection in Riyadh city [J] . Abdulwahab Almutairi International Journal of Advanced Statistics and Probability . 2020,第1期

机译：利用迭代贪婪和随机迭代的贪婪算法来解决利雅得城市的城市地区废物收集
3. Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm [J] . Paulus J., Klapuri A. Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第6期

机译：使用概率适应性测度和贪婪搜索算法的音乐结构分析
4. A Probabilistic Greedy Search Value Iteration Algorithm for POMDP [C] . Feng Liu, Zebang Song International Conference on Tools with Artificial Intelligence . 2016

机译：POMDP的概率贪婪搜索值迭代算法
5. Probabilistic Trans-Algorithmic Search. [D] . Gonen, Bilal. 2011

机译：概率跨算法搜索。
6. A Modified Sine-Cosine Algorithm Based on Neighborhood Search and Greedy Levy Mutation [O] . Chiwen Qu, Zhiliu Zeng, Jun Dai, 2018

机译：一种基于邻域搜索和贪婪征税变异的正弦余弦算法
7. Iterated-greedy-based algorithms with beam search initialization for the permutation flowshop to minimise total tardiness [O] . Victor Fernandez-Viagas, Jorge M.S. Valente, Jose M. Framinan 2018

机译：基于迭代 - 贪婪的算法，具有光束搜索初始化的置换流程，以最大限度地减少总迟到

A Probabilistic Greedy Search Value Iteration Algorithm for POMDP

摘要

著录项

相似文献

相关主题

期刊订阅