Title :
Analysis of an adaptive control scheme for a partially observed controlled Markov chain
Author :
Arapostathis, Aristotle ; Fernandez-Gaucherand, E. ; Marcus, Steven I.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Abstract :
The authors present parameter estimation techniques based on the information available after actions that reset the state to a known value are taken. At these times, the (augmented) state process `regenerates,´ its future evolution becoming independent of the past. Using the ordinary differential equation method, w.p.1 convergence of the parameter estimates to the true (unknown) parameter θ0 is demonstrated for two parameter estimation schemes of this type. Then, given any sequence of parameter estimates which is measurable with respect to the filtration generated by the observations and converges w.p.1 to θ0, the expected long-run average cost incurred by the certainty equivalent adaptive policy is the same as that incurred by the optimal stationary policy corresponding to the true parameter. The latter is obtained by a detailed analysis which uses the known parameters
Keywords :
Markov processes; adaptive control; parameter estimation; stochastic systems; adaptive control scheme; augmented state process regeneration; convergence; expected long-run average cost; ordinary differential equation; parameter estimation; partially observed controlled Markov chain; Adaptive control; Communication networks; Communication system control; Linear systems; Maximum likelihood detection; Nonlinear filters; Parameter estimation; Recursive estimation; State estimation; Stochastic processes;
Conference_Titel :
Decision and Control, 1990., Proceedings of the 29th IEEE Conference on
Conference_Location :
Honolulu, HI
DOI :
10.1109/CDC.1990.203849