首页> 外文期刊>Electronic Journal of Statistics >Randomized allocation with arm elimination in a bandit problem with covariates
【24h】

Randomized allocation with arm elimination in a bandit problem with covariates

机译:带有协变量的匪徒问题中具有手臂消除的随机分配

获取原文
           

摘要

Motivated by applications in personalized web services and clinical research, we consider a multi-armed bandit problem in a setting where the mean reward of each arm is associated with some covariates. A multi-stage randomized allocation with arm elimination algorithm is proposed to combine the flexibility in reward function modeling and a theoretical guarantee of a cumulative regret minimax rate. When the function smoothness parameter is unknown, the algorithm is equipped with a histogram estimation based smoothness parameter selector using Lepski’s method, and is shown to maintain the regret minimax rate up to a logarithmic factor under a “self-similarity” condition.
机译:受个性化Web服务和临床研究中应用程序的激励,我们考虑在每个手臂的平均收益与某些协变量相关联的环境中的多臂强盗问题。提出了一种多阶段随机分配的手臂消除算法,结合了奖励函数建模的灵活性和累积后悔最小极大值率的理论保证。当函数平滑度参数未知时,该算法将配备使用Lepski方法的基于直方图估计的平滑度参数选择器,并显示在“自相似”条件下将后悔最小最大速率保持为对数因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号