Extreme Bandits Using Robust Statistics

Sujay Bhatt; Ping Li; Gennady Samorodnitsky

首页> 外文期刊>IEEE Transactions on Information Theory >Extreme Bandits Using Robust Statistics

【24h】

Extreme Bandits Using Robust Statistics

机译：Extreme Bandits Using Robust Statistics

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Motivated by situations where the extreme values – as opposed to expected values in the classical stochastic multi-armed bandit (MAB) setting – are of interest, we propose a distribution-free algorithm for $textit {extreme bandits}$ and characterize its statistical properties. The proposed novel algorithm is index based, where the index is fashioned in a non-parametric way using combinatorics and robust statistics. For distributions having “exponential-like tails” and “polynomial-like tails”, we establish the following results: (i) the proposed algorithm is consistent, i.e., the index corresponding to the best arm will have the largest value asymptotically; (ii) the proposed algorithm achieves vanishing extremal regret under weaker conditions than the existing algorithms. Numerical experiments on the common class of distributions considered in the literature on extreme bandits highlight the superior finite-sample performance of the proposed algorithm compared to the state of the art.

著录项

来源
《IEEE Transactions on Information Theory》 |2023年第3期|1761-1776|共16页
作者
Sujay Bhatt; Ping Li; Gennady Samorodnitsky;
展开▼
作者单位

Cognitive Computing Laboratory, Baidu Research, Bellevue, WA, USA;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类通信;
关键词
Tail; Indexes; Approximation algorithms; Anomaly detection; Object recognition; Web and internet services; Time measurement;

Extreme Bandits Using Robust Statistics

摘要

著录项

相关主题

期刊订阅