Title :
On the optimality of the Gittins index rule in multi-armed bandits with multiple plays
Author :
Pandelis, Dimitrios G. ; Teneketzis, Demosthenis
Author_Institution :
Environ. Res. Inst. of Michigan, Ann Arbor, MI, USA
Abstract :
Investigates the general multi-armed bandit problem with multiple servers. The authors determine a condition on the reward processes sufficient to guarantee the optimality of the strategy that operates at each instant of time the projects with the highest Gittins indices. The authors call this strategy the Gittins index rule for multi-armed bandits with multiple plays, or briefly the Gittins index rule. The authors show by examples that: (i) the aforementioned sufficient condition is not necessary for the optimality of the Gittins index rule; and (ii) when the sufficient condition is relaxed the Gittins index rule is not necessarily optimal. Finally, the authors present an application of the general results to the multiserver scheduling of parallel queues without arrivals
Keywords :
game theory; queueing theory; resource allocation; scheduling; Gittins index rule; Gittins indices; multi-armed bandits; multiple plays; multiserver scheduling; optimality; parallel queues; reward processes; sufficient condition; Costs; Infinite horizon; Resource management; Stochastic processes; Sufficient conditions;
Conference_Titel :
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location :
New Orleans, LA
Print_ISBN :
0-7803-2685-7
DOI :
10.1109/CDC.1995.480298