Title :
Computationally efficient adaptive control algorithms for Markov chains
Author :
Jalali, A. ; Ferguson, M.
Author_Institution :
Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
Abstract :
Algorithms for adaptive control of unknown finite Markov chains are proposed. The algorithms consist of two parts: part one estimates the unknown parameters; part two computes the optimal policy. In this study the emphasis is on efficient online computation of the optimal policy. No a priori knowledge of the optimal policy is assumed. The optimal policy is computed recursively online. At each step a small amount of computation is required. At each transition of the chain, only the act corresponding to the present state of the chain is updated. The algorithms are easy to implement and converge to the optimal policy in finite time
Keywords :
Markov processes; adaptive control; parameter estimation; Markov chains; Markov processes; adaptive control algorithms; online computation; optimal policy; parameter estimation; Adaptive control; Automatic control; Convergence; Cost function; Heuristic algorithms; Learning automata; Maximum likelihood estimation; Optimal control; Parameter estimation; State-space methods;
Conference_Titel :
Decision and Control, 1989., Proceedings of the 28th IEEE Conference on
Conference_Location :
Tampa, FL
DOI :
10.1109/CDC.1989.70344