CONVERGENCE PROPERTIES OF A COMPUTATIONAL LEARNING MODEL FOR UNKNOWN MARKOV CHAINS

机译：Unknown Markov链计算的计算学习模型的融合属性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The increasing complexity of engineering systems has motivated continuing research on computational learning methods towards making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to be able to sense their environment, but should also integrate information from the environment into all decision making. The evolution of such systems is modeled as an unknown controlled Markov chain. In previous research, the predictive optimal decision-making (POD) model was developed that aims to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of POD to the stationary distribution of a Markov chain is proven, thus establishing POD as a robust model for making autonomous intelligent systems. The paper provides the conditions that POD can be valid, and an interpretation of its underlying structure.

机译：工程系统的复杂性越来越复杂地具有关于计算自动智能系统的计算方法的持续研究，这些方法可以在与环境互动时学会随着时间的推移提高他们的性能。这些系统不仅需要感知他们的环境，而且还应该将来自环境的信息集成到所有决策中。这种系统的演变被建模为一个未知的受控马尔可夫链。在先前的研究中，开发了预测最佳决策（POD）模型，旨在实时学习未知的过渡概率和相关的有限时间范围内的相关成本。在本文中，证明了POD与马尔可夫链的固定分布的收敛，从而建立了作为制作自主智能系统的鲁棒模型的POD。本文提供了POD可能有效的条件，以及对其底层结构的解释。

著录项

来源
《ASME Dynamic Systems and Control Conference》|2008年||共8页
会议地点
作者
Andreas A. Malikopoulos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 T-53;
关键词

相似文献

外文文献
中文文献
专利

1. Convergence Properties of a Computational Learning Model for Unknown Markov Chains [J] . Andreas A. Malikopoulos Journal of Dynamic Systems, Measurement, and Control . 2009,第4期

机译：未知马尔可夫链的计算学习模型的收敛性质
2. Convergence Properties of a Computational Learning Model for Unknown Markov Chains [J] . Andreas A. Malikopoulos Journal of Dynamic Systems, Measurement, and Control . 2009,第4期

机译：未知马尔可夫链的计算学习模型的收敛性质
3. Switching diffusion logistic models involving singularly perturbed Markov chains: Weak convergence and stochastic permanence [J] . Li Xiaoyue, Yin George Stochastic Analysis and Applications . 2017,第2期

机译：切换涉及单个扰动的Markov链条的扩散逻辑模型：弱收敛和随机持久性
4. CONVERGENCE PROPERTIES OF A COMPUTATIONAL LEARNING MODEL FOR UNKNOWN MARKOV CHAINS [C] . ASME dynamic systems and control conference . 2009

机译：Unknown Markov链计算的计算学习模型的融合属性
5. Bayesian Inference for High Dimensional Models: Convergence Properties and Computational Issues. [D] . Banerjee, Sayantan. 2014

机译：高维模型的贝叶斯推断：收敛性和计算问题。
6. Performance of Markov Chain–Monte Carlo Approaches for Mapping Genes in Oligogenic Models with an Unknown Number of Loci [O] . Jae K. Lee, Duncan C. Thomas 2000

机译：在基因座数目未知的寡聚模型中基因映射的马尔可夫链-蒙特卡洛方法的性能
7. Convergence properties of perturbed Markov chains [O] . Gareth O. Roberts, Jeffrey S. Rosenthal, Peter O. Schwartz 1998

机译：扰动马尔可夫链的收敛性
8. Convergence Properties of Continuous-Time Markov Chains with Application to Target Search [R] . Jun, M. , Jeffcoat, D. E. 2005

机译：连续时间马氏链的收敛性及其在目标搜索中的应用

CONVERGENCE PROPERTIES OF A COMPUTATIONAL LEARNING MODEL FOR UNKNOWN MARKOV CHAINS

摘要

著录项

相似文献

相关主题

期刊订阅