Title :
Analysis of an adaptive control scheme for a partially observed controlled Markov chain
Author :
Fernandez-Gaucherand, E. ; Arapostathis, Aristotle ; Marcus, Steven I.
Author_Institution :
Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
fDate :
6/1/1993 12:00:00 AM
Abstract :
The authors consider an adaptive finite state controlled Markov chain with partial state information, motivated by a class of replacement problems. They present parameter estimation techniques based on the information available after actions that reset the state to a known value are taken. It is proved that the parameter estimates converge w.p.1 to the true (unknown) parameter, under the feedback structure induced by a certainty equivalent adaptive policy. It is shown that the adaptive policy is self-optimizing in a long-run average sense, for any (measurable) sequence of parameter estimates converging w.p.1 to the true parameter
Keywords :
Markov processes; adaptive control; control system analysis; differential equations; parameter estimation; adaptive control; feedback structure; parameter estimation; partially observed controlled Markov chain; w.p.1; Adaptive control; Adaptive estimation; Convergence; Feedback; Filters; Information analysis; Parameter estimation; Programmable control; Recursive estimation; State estimation;
Journal_Title :
Automatic Control, IEEE Transactions on