DocumentCode :
839507
Title :
Optimal adaptive controllers for unknown Markov chains
Author :
Kumar, P.R. ; Lin, Woei
Author_Institution :
University of Maryland, Baltimore, MD, USA
Volume :
27
Issue :
4
fYear :
1982
fDate :
8/1/1982 12:00:00 AM
Firstpage :
765
Lastpage :
774
Abstract :
We consider the problem of adaptively controlling an unknown Markov chain. No prior information regarding the values of the transition probabilities is provided us (except for a list of forbidden, zero-probability transitions, which is usually obtained as a byproduct of the modeling process itself). The goal is to design an adaptive controller to adequately control the unknown system when its performance is measured by the average cost incurred over a long operating time period. Our main result is the exhibition of a family of adaptive controllers which, when applied to the unknown system, will result in a performance precisely equal to the optimal performance attainable if the system, i.e., the transition probabilities, were known. Hence, the adaptive controllers proposed here are truly optimal, even when operating on an unknown system. The results presented here extend similar results in [1] where we assume to be initially provided with a finite set of possible models, one of which is guaranteed to be the true one. This paper directly addresses those practical situations where a finite set of possible models with such a guarantee is hard to come by.
Keywords :
Adaptive control; Markov processes; Optimal stochastic control; Stochastic optimal control; Uncertain systems; Adaptive control; Control systems; Cost function; Digital control; Learning systems; Mathematics; Optimal control; Pattern recognition; Programmable control; Time measurement;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.1982.1103017
Filename :
1103017
Link To Document :
بازگشت