مرکز منطقه ای اطلاع رساني علوم و فناوري - Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost

DocumentCode :

1437891

Title :

Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost

Author :

Agrawal, Rajeev ; Hedge, M.V. ; Teneketzis, Demosthenis

Author_Institution :

Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA

Volume :

Issue :

fYear :

1988

Firstpage :

899

Lastpage :

906

Abstract :

The authors consider multiarmed bandit problems with switching cost, define uniformly good allocation rules, and restrict attention to such rules. They present a lower bound on the asymptotic performance of uniformly good allocation rules and construct an allocation scheme that achieves the bound. It is found that despite the inclusion of a switching cost the proposed allocation scheme achieves the same asymptotic performance as the optimal rule for the bandit problem without switching cost. This is made possible by grouping together samples into blocks of increasing sizes, thereby reducing the number of switches to O(log n). Finally, an optimal allocation scheme for a large class of distributions which includes members of the exponential family is illustrated.<>

Keywords :

operations research; optimisation; adaptive allocation rules; multiarmed bandit problem; operations research; optimal allocation; switching cost; Cost function; Laboratories; Random variables; Resource management; Sampling methods; Signal processing; Statistics;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/9.7243

Filename :

7243

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1437891