DocumentCode
2974479
Title
Asymptotically efficient adaptive allocation schemes for controlled Markov chains: finite parameter space
Author
Agrawal, Rajeev ; Teneketzis, Demosthenis ; Anantharam, Venkatachalam
Author_Institution
Dept. of Electr. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
fYear
1988
fDate
7-9 Dec 1988
Firstpage
1198
Abstract
The authors consider a controlled Markov chain whose transition probabilities and initial distribution are parameterized by an unknown parameter θ to some known parameter space Θ. There is a one-step reward associated with each pair of control and following state of the process. The objective is to maximize the expected value of the sum of one-step rewards over an infinite horizon. By introducing the loss associated with a control scheme, the authors show that the problem is equivalent to minimizing the loss. They define uniformly good adaptive control schemes and restrict attention to these schemes. They develop a lower bound on the loss associated with any uniformly good control scheme. Finally, they construct an adaptive control scheme whose loss equals the lower bound and is therefore optimal
Keywords
Markov processes; adaptive control; probability; adaptive allocation schemes; adaptive control; controlled Markov chains; infinite horizon; initial distribution; one-step reward; transition probabilities; Adaptive control; Adaptive signal processing; Character generation; Communication system control; Infinite horizon; Laboratories; Optimal control; Programmable control; Stochastic systems; Weight control;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1988., Proceedings of the 27th IEEE Conference on
Conference_Location
Austin, TX
Type
conf
DOI
10.1109/CDC.1988.194511
Filename
194511
Link To Document