A non-parametric solution to the multi-armed bandit problem with covariates

Ai Mingyao; Huang Yimin; Yu Jun

首页> 外文期刊>European Journal of Medicinal Chemistry: Chimie Therapeutique >A non-parametric solution to the multi-armed bandit problem with covariates

【24h】

A non-parametric solution to the multi-armed bandit problem with covariates

机译：具有协变量的多武装强盗问题的非参数解决方案

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, the multi-armed bandit problem regains popularity especially for the case with covariates since it has new applications in customized services such as personalized medicine. To deal with the bandit problem with covariates, a policy called binned subsample mean comparison that decomposes the original problem into some proper classic bandit problems is introduced. The growth rate in a setting that the reward of each arm depends on observable covariates is studied accordingly. When rewards follow an exponential family, it can be shown that the regret of the proposed method can achieve the nearly optimal growth rate. Simulations show that the proposed policy has the competitive performance compared with other policies. (C) 2020 Elsevier B.V. All rights reserved.

机译：近年来，多武装匪徒问题重新流行起来，尤其是对于具有协变量的情况，因为它在个性化医疗等定制服务中有了新的应用。为了解决带有协变量的bandit问题，引入了一种称为bined子样本均值比较的策略，将原始问题分解为一些适当的经典bandit问题。在每个手臂的奖励取决于可观测协变量的情况下，相应地研究了增长率。当报酬服从指数族时，可以证明所提出的方法可以获得接近最优的增长率。仿真结果表明，与其他策略相比，该策略具有更好的性能。（C） 2020爱思唯尔B.V.版权所有。

著录项

来源
《European Journal of Medicinal Chemistry: Chimie Therapeutique》 |2021年第1期|共12页
作者
Ai Mingyao; Huang Yimin; Yu Jun;
展开▼
作者单位

Peking Univ Sch Math Sci LMAM Beijing 100871 Peoples R China;

Peking Univ Sch Math Sci LMAM Beijing 100871 Peoples R China;

Beijing Inst Technol Sch Math &

Stat Beijing 100081 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类药学;
关键词
Efficient policy; Multi-armed bandit problem; Nonparametric solution; Subsample comparisons;

机译：有效的政策;多武装土匪问题;非参数解;子样本比较;

相似文献

外文文献
中文文献
专利

1. Residential HVAC Aggregation Based on Risk-averse Multi-armed Bandit Learning for Secondary Frequency Regulation [J] . Xinyi Chen, Qinran Hu, Qingxin Shi, 现代电力系统与清洁能源学报(英文) . 2020,第006期
2. A non-parametric solution to the multi-armed bandit problem with covariates [J] . Ai Mingyao, Huang Yimin, Yu Jun Journal of Statistical Planning and Inference . 2021,第1期

机译：具有协变量的多武装强盗问题的非参数解决方案
3. The K-Nearest Neighbour UCB Algorithm for Multi-Armed Bandits with Covariates [J] . Henry Reeve, Joe Mellor, Gavin Brown JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：具有协变量的多武装土匪的K最近邻UCB算法
4. Minimax Concave Penalized Multi-Armed Bandit Model with High-Dimensional Covariates [J] . Xue Wang, Mingcheng Wei, Tao Yao JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：Minimax禁止惩罚惩罚多武装强盗模型，具有高维协调因子
5. Simulation studies of Multi-Armed Bandits with Covariates [C] . Nicos G. Pavlidis, Dimitris K. Tasoulis, David J. Hand EUROSIM/UKSim . 2008

机译：协变者多武装匪徒的仿真研究
6. Offline Evaluation of Multi-Armed Bandit Algorithms Using Bootstrapped Replay on Expanded Data [D] . Dai, Jin. 2021

机译：在扩展数据上使用引导重播的多武装强盗算法的离线评估
7. Smoking and the bandit: A preliminary study of smoker and non-smoker differences in exploratory behavior measured with a multi-armed bandit task [O] . Merideth A. Addicott, John M. Pearson, Jessica Wilson, -1

机译：吸烟和强盗：用多武装强盗任务测量的探索性行为的吸烟者和非吸烟者差异的初步研究
8. MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS [O] . Dong Woo Kim, Tze Leung Lai, Huanzhong Xu 2020

机译：具有协变量的多武装匪徒：理论与应用
9. Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit [R] . Liu, H., Liu, K., Zhao, Q. 2010

机译：在变化的世界中学习：非贝叶斯不安定的多武装强盗

A non-parametric solution to the multi-armed bandit problem with covariates

摘要

著录项

相似文献

相关主题

期刊订阅