DocumentCode :
2200811
Title :
Decentralized multi-armed bandit with multiple distributed players
Author :
Liu, Keqin ; Zhao, Qing
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA, USA
fYear :
2010
fDate :
Jan. 31 2010-Feb. 5 2010
Firstpage :
1
Lastpage :
10
Abstract :
We formulate and study a decentralized multi-armed bandit (MAB) problem, where M distributed players compete for N independent arms with unknown reward statistics. At each time, each player chooses one arm to play without exchanging information with other players. Players choosing the same arm collide, and, depending on the collision model, either no one receives reward or the colliding players share the reward in an arbitrary way. We show that the minimum system regret of the decentralized MAB grows with time at the same logarithmic order as in the centralized counterpart where players act collectively as a single entity by exchanging observations and making decisions jointly. A general framework of constructing fair and order-optimal decentralized policies is established based on a Time Division Fair Sharing (TDFS) of the M best arms. A lower bound on the system regret growth rate is established for a general class of decentralized polices, to which all TDFS policies belong. We further develop several fair and order-optimal decentralized polices within the TDFS framework and study their performance in different applications including cognitive radio networks, multi-channel communications in unknown fading environment, target collecting in multi-agent systems, and web search and advertising.
Keywords :
cognitive radio; radio networks; Web search; cognitive radio networks; collision model; decentralized multiarmed bandit problem; lower bound; multiagent systems; multichannel communications; multiple distributed players; order-optimal decentralized policy; time division fair sharing; Advertising; Arm; Cognitive radio; Fading; History; Loss measurement; Multiagent systems; Performance loss; Statistical distributions; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and Applications Workshop (ITA), 2010
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-7012-9
Electronic_ISBN :
978-1-4244-7014-3
Type :
conf
DOI :
10.1109/ITA.2010.5454071
Filename :
5454071
Link To Document :
بازگشت