...
首页> 外文期刊>IEEE Transactions on Automatic Control >Sample-path average optimality for Markov control processes
【24h】

Sample-path average optimality for Markov control processes

机译:马尔可夫控制过程的样本路径平均最优

获取原文
获取原文并翻译 | 示例
           

摘要

The authors consider a Markov control process with Borel state and actions spaces, unbounded costs, and under the long-run sample-path average cost criterion. They prove that under very weak assumptions on the transition law and a moment assumption for the one-step cost, there exists a stationary policy with invariant probability distribution v, that is sample-path average cost optimal for v-almost all initial states. In addition, every expected average-cost optimal stationary policy is in fact (liminf) sample-path average-cost optimal and strongly expected average-cost optimal.
机译:作者考虑了具有Borel状态和动作空间,无限制成本以及长期样本路径平均成本准则的Markov控制过程。他们证明,在关于过渡律的非常弱的假设和单步成本的瞬时假设的情况下,存在一个具有不变概率分布v的平稳策略,即对于v几乎所有初始状态而言最优的样本路径平均成本。另外,每个期望的平均成本最优平稳策略实际上是(liminf)样本路径平均成本最优和强烈期望的平均成本最优。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号