مرکز منطقه ای اطلاع رساني علوم و فناوري - Bandit problems in networks: Asymptotically efficient distributed allocation rules

DocumentCode :

3431416

Title :

Bandit problems in networks: Asymptotically efficient distributed allocation rules

Author :

Kar, Soummya ; Poor, H. Vincent ; Cui, Shuguang

Author_Institution :

Dept. of Electrical Engineering, Princeton University, NJ 08544, USA

fYear :

2011

fDate :

12-15 Dec. 2011

Firstpage :

1771

Lastpage :

1778

Abstract :

This paper studies the multi-agent bandit problem in a distributed networked setting. The setting considered assumes only one bandit (the major bandit) has accessible reward information from its samples, whereas the rest (the minor bandits) have unobservable rewards. Under the assumption that the minor bandits are aware of the sampling pattern of the major bandit (but with no direct access to its rewards), a lower bound on the expected average network regret is obtained. The lower bound resembles the logarithmic optimal regret attained in single (classical) bandit problems, but in addition is shown to scale down with the number of agents. A collaborative and adaptive distributed allocation rule DA is proposed and is shown to achieve the lower bound on the expected average regret for a connected inter-bandit communication network. In particular, it is shown that under the DA allocation rule, the minor bandits attain sub-logarithmic expected regrets as opposed to logarithmic in the single agent setting.

Keywords :

Collaboration; Decision making; Density measurement; Random variables; Resource management; Symmetric matrices; Vectors; Asymptotically Efficient; Distributed Allocation Rules; Networked Bandit Problems; Partially Observable Rewards;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

0743-1546

Print_ISBN :

978-1-61284-800-6

Electronic_ISBN :

0743-1546

Type :

conf

DOI :

10.1109/CDC.2011.6160719

Filename :

6160719

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3431416