Title :
Finite dimensional algorithms for the hidden Markov model multi-armed bandit problem
Author :
Krishnamurthy, Vikram ; Mickova, Josipa
Author_Institution :
Dept. of Electr. Eng., Melbourne Univ., Parkville, Vic., Australia
Abstract :
The multi-arm bandit problem is widely used in scheduling of traffic in broadband networks, manufacturing systems and robotics. This paper presents a finite dimensional optimal solution to the multi-arm bandit problem for hidden Markov models. The key to solving any multi-arm bandit problem is to compute the Gittins (1979, 1989) index. In this paper a finite dimensional algorithm is presented which exactly computes the Gittins index. Suboptimal algorithms for computing the Gittins index are also presented and experimentally shown to perform almost as well as the optimal method. Finally an application of the algorithms to tracking multiple targets with a single intelligent sensor is presented
Keywords :
broadband networks; hidden Markov models; intelligent sensors; manufacture; robots; scheduling; target tracking; telecommunication traffic; Gittins index; HMM; broadband networks; finite dimensional algorithms; finite dimensional optimal solution; hidden Markov model; intelligent sensor; manufacturing systems; multi-armed bandit problem; multiple targets tracking; robotics; suboptimal algorithms; traffic scheduling; Broadband communication; Hidden Markov models; Intelligent sensors; Job shop scheduling; Manufacturing systems; Processor scheduling; Robots; Target tracking; Telecommunication traffic; Traffic control;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.761360