Optimality of index policies for a sequential sampling problem

Castanon D.; Streltsov S.; Vakili P.

首页> 外文期刊>IEEE Transactions on Automatic Control >Optimality of index policies for a sequential sampling problem

【24h】

Optimality of index policies for a sequential sampling problem

机译：顺序抽样问题的索引策略的最优性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Consider the following sequential sampling problem: at each time, a choice must be made between obtaining an independent sample from one of a set of random reward variables or stopping the sampling. Sampling a random variable incurs a random cost at each time. The objective of the problem is to maximize the expected nett difference between the largest sample reward obtained before stopping and the accumulated costs incurred while sampling. In this paper, the authors prove that the optimal feedback strategies for this problem are index policies and provide an explicit expression for the optimal expected reward from any state. The problem is motivated by search methods for global optimization problems where the cost of computation is explicitly incorporated into the objective

机译：考虑以下顺序抽样问题：每次必须在从一组随机奖励变量之一中获得独立样本或停止抽样之间做出选择。对随机变量进行采样每次都会产生随机成本。该问题的目的是使停止前获得的最大样本奖励与采样时产生的累计成本之间的预期净差最大。在本文中，作者证明了针对该问题的最佳反馈策略是索引策略，并为任何状态的最佳预期奖励提供了明确的表示。该问题是由针对全局优化问题的搜索方法引起的，其中，计算成本已明确纳入目标

著录项

来源
《IEEE Transactions on Automatic Control》 |1999年第1期|p.145-148|共4页
作者
Castanon D.; Streltsov S.; Vakili P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
dynamic programming; random processes; sampling methods; search problems; dynamic programming; feedback; global optimization; optimal index policy; random cost; random variable; search methods; sequential sampling; statistical model;

机译：动态规划;随机过程;采样方法;搜索问题;动态规划;反馈;全局优化;最优索引策略;随机成本;随机变量;搜索方法;顺序采样;统计模型;

相似文献

外文文献
中文文献
专利

1. Optimality of index policies for a sequential sampling problem [J] . Castanon D., Streltsov S. IEEE Transactions on Automatic Control . 1999,第1期

机译：顺序抽样问题的索引策略的最优性
2. Optimal sequential sampling policy of partitioned random search and its approximation [J] . Tang ZB. Journal of Optimization Theory and Applications . 1998,第2期

机译：分区随机搜索的最佳顺序抽样策略及其逼近
3. Optimality, sample size, and power calculations for the sequential parallel comparison design. [J] . Ivanova A, Qaqish B, Schoenfeld DA Statistics in medicine . 2011,第23期

机译：顺序并行比较设计的最优性，样本量和功效计算。
4. Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path [C] . Andras Antos, Csaba Szepesvari, Remi Munos Annual Conference on Learning Theory(COLT 2006); 20060622-25; Pittsburgh,PA(US) . 2006

机译：通过基于Bellman-残差最小化的拟合策略迭代和单个样本路径学习近乎最优的策略
5. Optimal sample policies for multirate digital control. [D] . Narigon, Michael Lee. 1991

机译：用于多速率数字控制的最佳样本策略。
6. Heuristic and optimal policy computations in the human brain during sequential decision-making [O] . Christoph W. Korn, Dominik R. Bach -1

机译：人脑在顺序决策中的启发式和最佳策略计算
7. Sample-size-optimal Bayesian schemes in sequential sampling [O] . Biele, Jonathan 1990

机译：顺序采样中的样本大小最优贝叶斯方案
8. Stereo under Sequential Optimal Sampling: A Statistical Analysis Framework for Search Space Reduction (Open Access). [R] . Wang, Y., Wang, K., Dunn, E., 2014

机译：顺序最优采样下的立体声：搜索空间减少的统计分析框架（开放存取）。

Optimality of index policies for a sequential sampling problem

摘要

著录项

相似文献

相关主题

期刊订阅