DocumentCode
138585
Title
Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting
Author
Jamieson, Kevin ; Nowak, Robert
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin - Madison, Madison, WI, USA
fYear
2014
fDate
19-21 March 2014
Firstpage
1
Lastpage
6
Abstract
This paper is concerned with identifying the arm with the highest mean in a multi-armed bandit problem using as few independent samples from the arms as possible. While the so-called “best arm problem” dates back to the 1950s, only recently were two qualitatively different algorithms proposed that achieve the optimal sample complexity for the problem. This paper reviews these recent advances and shows that most best-arm algorithms can be described as variants of the two recent optimal algorithms. For each algorithm type we consider a specific instance to analyze both theoretically and empirically thereby exposing the core components of the theoretical analysis of these algorithms and intuition about how the algorithms work in practice. The derived sample complexity bounds are novel, and in certain cases improve upon previous bounds. In addition, we compare a variety of state-of-the-art algorithms empirically through simulations for the best-arm-problem.
Keywords
computational complexity; cores; estimation theory; optimal systems; best arm problem; best-arm identification; core components; fixed confidence setting; independent samples; multiarmed bandit problem; optimal algorithms; optimal sample complexity; sample complexity bounds; Algorithm design and analysis; Complexity theory; Computers; Educational institutions; Electronic mail; Random variables;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences and Systems (CISS), 2014 48th Annual Conference on
Conference_Location
Princeton, NJ
Type
conf
DOI
10.1109/CISS.2014.6814096
Filename
6814096
Link To Document