DocumentCode
3482991
Title
Computationally efficient adaptive control algorithms for Markov chains
Author
Jalali, A. ; Ferguson, M.
Author_Institution
Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
fYear
1989
fDate
13-15 Dec 1989
Firstpage
1283
Abstract
Algorithms for adaptive control of unknown finite Markov chains are proposed. The algorithms consist of two parts: part one estimates the unknown parameters; part two computes the optimal policy. In this study the emphasis is on efficient online computation of the optimal policy. No a priori knowledge of the optimal policy is assumed. The optimal policy is computed recursively online. At each step a small amount of computation is required. At each transition of the chain, only the act corresponding to the present state of the chain is updated. The algorithms are easy to implement and converge to the optimal policy in finite time
Keywords
Markov processes; adaptive control; parameter estimation; Markov chains; Markov processes; adaptive control algorithms; online computation; optimal policy; parameter estimation; Adaptive control; Automatic control; Convergence; Cost function; Heuristic algorithms; Learning automata; Maximum likelihood estimation; Optimal control; Parameter estimation; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1989., Proceedings of the 28th IEEE Conference on
Conference_Location
Tampa, FL
Type
conf
DOI
10.1109/CDC.1989.70344
Filename
70344
Link To Document