DocumentCode
3082349
Title
Adaptive control of Markov chains under the weak accessibility
Author
Agrawal, Rajeev
Author_Institution
Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
fYear
1990
fDate
5-7 Dec 1990
Firstpage
1426
Abstract
The author considers the adaptive control of Markov chains under the weak accessibility condition with the objective of minimizing the learning loss. First, it is shown that, by using a stationary randomized control scheme, the maximum likelihood estimate of the unknown parameter converges exponentially fast to its true value. Then a certainty equivalence control with a forcing type scheme is constructed with alternative phases of forcing and certainty equivalence control. The stationary randomized control scheme for forcing is used in such a way that by cutting and pasting the resulting observations a single Markov chain is obtained. This in turn allows the rate of forcing to be chosen appropriately, giving a learning loss of O (f (n )log n ) for any function f (n )→∞ as n →∞
Keywords
Markov processes; adaptive control; probability; stochastic systems; Markov chains; adaptive control; certainty equivalence control; convergence; forcing; learning loss minimization; maximum likelihood estimate; stationary randomized control scheme; weak accessibility; Adaptive control; Arm; Costs; Optimal control; Parameter estimation; State-space methods; Stochastic processes; Stochastic systems; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1990., Proceedings of the 29th IEEE Conference on
Conference_Location
Honolulu, HI
Type
conf
DOI
10.1109/CDC.1990.203847
Filename
203847
Link To Document