DocumentCode :
2759959
Title :
Finite dimensional algorithms for the hidden Markov model multi-armed bandit problem
Author :
Krishnamurthy, Vikram ; Mickova, Josipa
Author_Institution :
Dept. of Electr. Eng., Melbourne Univ., Parkville, Vic., Australia
Volume :
5
fYear :
1999
fDate :
1999
Firstpage :
2865
Abstract :
The multi-arm bandit problem is widely used in scheduling of traffic in broadband networks, manufacturing systems and robotics. This paper presents a finite dimensional optimal solution to the multi-arm bandit problem for hidden Markov models. The key to solving any multi-arm bandit problem is to compute the Gittins (1979, 1989) index. In this paper a finite dimensional algorithm is presented which exactly computes the Gittins index. Suboptimal algorithms for computing the Gittins index are also presented and experimentally shown to perform almost as well as the optimal method. Finally an application of the algorithms to tracking multiple targets with a single intelligent sensor is presented
Keywords :
broadband networks; hidden Markov models; intelligent sensors; manufacture; robots; scheduling; target tracking; telecommunication traffic; Gittins index; HMM; broadband networks; finite dimensional algorithms; finite dimensional optimal solution; hidden Markov model; intelligent sensor; manufacturing systems; multi-armed bandit problem; multiple targets tracking; robotics; suboptimal algorithms; traffic scheduling; Broadband communication; Hidden Markov models; Intelligent sensors; Job shop scheduling; Manufacturing systems; Processor scheduling; Robots; Target tracking; Telecommunication traffic; Traffic control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.761360
Filename :
761360
Link To Document :
بازگشت