Computationally efficient adaptive control algorithms for Markov chains

Author

Jalali, A. ; Ferguson, M.

Author_Institution

Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada

fYear

1989

fDate

13-15 Dec 1989

Firstpage

1283

Abstract

Algorithms for adaptive control of unknown finite Markov chains are proposed. The algorithms consist of two parts: part one estimates the unknown parameters; part two computes the optimal policy. In this study the emphasis is on efficient online computation of the optimal policy. No a priori knowledge of the optimal policy is assumed. The optimal policy is computed recursively online. At each step a small amount of computation is required. At each transition of the chain, only the act corresponding to the present state of the chain is updated. The algorithms are easy to implement and converge to the optimal policy in finite time

Keywords

Markov processes; adaptive control; parameter estimation; Markov chains; Markov processes; adaptive control algorithms; online computation; optimal policy; parameter estimation; Adaptive control; Automatic control; Convergence; Cost function; Heuristic algorithms; Learning automata; Maximum likelihood estimation; Optimal control; Parameter estimation; State-space methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control, 1989., Proceedings of the 28th IEEE Conference on

Conference_Location

Tampa, FL

Type

conf

DOI

10.1109/CDC.1989.70344

Filename

70344

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3482991