...
首页> 外文期刊>IEEE Transactions on Automatic Control >Optimality of index policies for a sequential sampling problem
【24h】

Optimality of index policies for a sequential sampling problem

机译:顺序抽样问题的索引策略的最优性

获取原文
获取原文并翻译 | 示例
           

摘要

Consider the following sequential sampling problem: at each time, a choice must be made between obtaining an independent sample from one of a set of random reward variables or stopping the sampling. Sampling a random variable incurs a random cost at each time. The objective of the problem is to maximize the expected nett difference between the largest sample reward obtained before stopping and the accumulated costs incurred while sampling. In this paper, the authors prove that the optimal feedback strategies for this problem are index policies and provide an explicit expression for the optimal expected reward from any state. The problem is motivated by search methods for global optimization problems where the cost of computation is explicitly incorporated into the objective
机译:考虑以下顺序抽样问题:每次必须在从一组随机奖励变量之一中获得独立样本或停止抽样之间做出选择。对随机变量进行采样每次都会产生随机成本。该问题的目的是使停止前获得的最大样本奖励与采样时产生的累计成本之间的预期净差最大。在本文中,作者证明了针对该问题的最佳反馈策略是索引策略,并为任何状态的最佳预期奖励提供了明确的表示。该问题是由针对全局优化问题的搜索方法引起的,其中,计算成本已明确纳入目标

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号