Sample-path average optimality for Markov control processes

Lasserre J.B.

首页> 外文期刊>IEEE Transactions on Automatic Control >Sample-path average optimality for Markov control processes

【24h】

Sample-path average optimality for Markov control processes

机译：马尔可夫控制过程的样本路径平均最优

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The authors consider a Markov control process with Borel state and actions spaces, unbounded costs, and under the long-run sample-path average cost criterion. They prove that under very weak assumptions on the transition law and a moment assumption for the one-step cost, there exists a stationary policy with invariant probability distribution v, that is sample-path average cost optimal for v-almost all initial states. In addition, every expected average-cost optimal stationary policy is in fact (liminf) sample-path average-cost optimal and strongly expected average-cost optimal.

机译：作者考虑了具有Borel状态和动作空间，无限制成本以及长期样本路径平均成本准则的Markov控制过程。他们证明，在关于过渡律的非常弱的假设和单步成本的瞬时假设的情况下，存在一个具有不变概率分布v的平稳策略，即对于v几乎所有初始状态而言最优的样本路径平均成本。另外，每个期望的平均成本最优平稳策略实际上是（liminf）样本路径平均成本最优和强烈期望的平均成本最优。

著录项

来源
《IEEE Transactions on Automatic Control》 |1999年第10期|P.1966-1971|共6页
作者
Lasserre J.B.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词

相似文献

外文文献
中文文献
专利

1. Sample-path average optimality for Markov control processes [J] . Lasserre J.B. IEEE Transactions on Automatic Control . 1999,第10期

机译：马尔可夫控制过程的样本路径平均最优
2. Optimal control of ergodic continuous-time Markov chains with average sample-path rewards [J] . Guo XP, Cao XR SIAM Journal on Control and Optimization . 2005,第1期

机译：具有平均样本路径奖励的遍历连续时间马尔可夫链的最优控制
3. A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion [J] . Rolando Cavazos-Cadena, Raúl Montes-de-Oca, Karel Sladky Journal of Optimization Theory and Applications . 2014,第2期

机译：具有平均奖励标准的稳定马尔可夫决策链中样本路径最优的反例
4. Sample-path and variance minimization of Markov control processes with average cost criteria [C] . Hernandez-Lerma, O., Vega-Amaya, Decision and Control, 2000. Proceedings of the 39th IEEE Conference on . 2000

机译：具有平均成本标准的马尔可夫控制过程的样本路径和方差最小化
5. Controlled Markov chains with risk-sensitive average cost criterion. [D] . Brau Rojas, Agustin. 1999

机译：具有风险敏感平均成本准则的受控马尔可夫链。
6. A Markovian Approach towards Bacterial Size Control and Homeostasis in Anomalous Growth Processes [O] . Yanyan Chen, Rosa Baños, Javier Buceta -1

机译：异常生长过程中细菌大小控制和体内平衡的马尔可夫方法
7. Sample-Path and Variance Minimization of Markov Control Processes with Average Cost Criteria [O] . Onésimo Hernández-Lerma, Oscar Vega-Amaya, Guadalupe Carrasco 2007

机译：具有平均成本标准的马尔可夫控制过程的样本路径和方差最小化
8. Discrete-Time Controlled Markov Processes With Average Cost Criterion: A Survey. [R] . Arapostathis, A., Borkar, V. S., Fernandez- Gaucherand, E., 1992

机译：具有平均成本标准的离散时间控制马尔可夫过程：一项调查。

Sample-path average optimality for Markov control processes

摘要

著录项

相似文献

相关主题

期刊订阅