DocumentCode :
138585
Title :
Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting
Author :
Jamieson, Kevin ; Nowak, Robert
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin - Madison, Madison, WI, USA
fYear :
2014
fDate :
19-21 March 2014
Firstpage :
1
Lastpage :
6
Abstract :
This paper is concerned with identifying the arm with the highest mean in a multi-armed bandit problem using as few independent samples from the arms as possible. While the so-called “best arm problem” dates back to the 1950s, only recently were two qualitatively different algorithms proposed that achieve the optimal sample complexity for the problem. This paper reviews these recent advances and shows that most best-arm algorithms can be described as variants of the two recent optimal algorithms. For each algorithm type we consider a specific instance to analyze both theoretically and empirically thereby exposing the core components of the theoretical analysis of these algorithms and intuition about how the algorithms work in practice. The derived sample complexity bounds are novel, and in certain cases improve upon previous bounds. In addition, we compare a variety of state-of-the-art algorithms empirically through simulations for the best-arm-problem.
Keywords :
computational complexity; cores; estimation theory; optimal systems; best arm problem; best-arm identification; core components; fixed confidence setting; independent samples; multiarmed bandit problem; optimal algorithms; optimal sample complexity; sample complexity bounds; Algorithm design and analysis; Complexity theory; Computers; Educational institutions; Electronic mail; Random variables;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences and Systems (CISS), 2014 48th Annual Conference on
Conference_Location :
Princeton, NJ
Type :
conf
DOI :
10.1109/CISS.2014.6814096
Filename :
6814096
Link To Document :
بازگشت