首页> 美国政府科技报告 >I. Criterion Equivalence in Discrete Dynamic Programming. II. Stochastic Games with Perfect Information and Time Average Payoff

【24h】

I. Criterion Equivalence in Discrete Dynamic Programming. II. Stochastic Games with Perfect Information and Time Average Payoff

机译：I.离散动态规划中的判据等价。 II。具有完美信息和时间平均收益的随机游戏

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is shown that for finite state and action space Markovian decision processes, a policy is 1-optimal if and only if it is average overtaking optimal so the two criteria are equivalent. A counterexample to an alleged extension of the Hardy-Littlewood Theorem is given, and the optimality of stationary strategies for stochastic games of perfect information with time average payoffs is established. (Author)

著录项

作者
Lippman, S. A.; Liggett, T. M.;
展开▼
作者单位

展开▼
年度 1968
页码 p.1-20
总页数 20
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic programming ; Stochastic processes ; Game theory ; Theorems ; Optimization ; Decision theory;

机译：动态规划;随机过程;博弈论;定理;优化;决策理论;

相似文献

外文文献
中文文献
专利

1. Stochastic Games with Average Payoff Criterion [J] . M. K. Ghosh, A. Bagchi Applied mathematics and optimization . 1998,第3期

机译：具有平均收益标准的随机游戏
2. Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion [J] . Lorenzo Jose Maria, Hernandez-Noriega Ismael, Prieto-Rumeau Tomas Operations Research Letters: A Journal of the Operations Research Society of America . 2015,第1期

机译：具有平均收益标准的两人零和连续时间马尔可夫游戏的逼近
3. Stochastic Games for Continuous-Time Jump Processes Under Finite-Horizon Payoff Criterion [J] . Wei Qingda, Chen Xian Applied mathematics and optimization . 2016,第2期

机译：有限地平线支付准则下用于连续时间跳跃过程的随机博弈
4. Subrecursive program schemata I amp; II(I. Undecidable equivalence problems, II. Decidable equivalence problems) [C] . Robert L. Constable, Steven S. Muchnick Annual ACM symposium on Theory of computing;ACM symposium on Theory of computing . 1972

机译：亚递归程序模式I和II（I。不可确定的等价问题，II。可确定的等价问题）
5. I. Fundamental practicum--fluorescence lifetime imaging: An approach for fuel equivalence ratio imaging. II. Industrial practicum--the interactions between ionic surfactants and divalent cations. III. Apprenticeship practicum--the relaxed and spectroscopic energies of olefin triplets. [D] . Ni, Tuqiang. 1990

机译：I.基本实践-荧光寿命成像：燃料当量比成像的一种方法。二。工业实践-离子表面活性剂与二价阳离子之间的相互作用三，学徒实习-烯烃三联体的弛豫和光谱能。
6. FUNCTIONAL EQUATIONS IN THE THEORY OF DYNAMIC PROGRAMMING. II. NONLINEAR DIFFERENTIAL EQUATIONS [O] . Richard Bellman 1955

机译：动态规划理论中的功能方程。二。非线性微分方程
7. Stochastic Games with Average Payoff Criterion [O] . Ghosh MK, Bagchi A 1998

机译：具有平均收益标准的随机游戏
8. Stochastic Games with Average Payoff Criterion [R] . Ghosh, M. K., Bagchi, A. 1991

机译：具有平均支付准则的随机游戏

I. Criterion Equivalence in Discrete Dynamic Programming. II. Stochastic Games with Perfect Information and Time Average Payoff

摘要

著录项

相似文献

相关主题

期刊订阅