مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive control of Markov chains under the weak accessibility

DocumentCode :

3082349

Title :

Adaptive control of Markov chains under the weak accessibility

Author :

Agrawal, Rajeev

Author_Institution :

Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA

fYear :

1990

fDate :

5-7 Dec 1990

Firstpage :

1426

Abstract :

The author considers the adaptive control of Markov chains under the weak accessibility condition with the objective of minimizing the learning loss. First, it is shown that, by using a stationary randomized control scheme, the maximum likelihood estimate of the unknown parameter converges exponentially fast to its true value. Then a certainty equivalence control with a forcing type scheme is constructed with alternative phases of forcing and certainty equivalence control. The stationary randomized control scheme for forcing is used in such a way that by cutting and pasting the resulting observations a single Markov chain is obtained. This in turn allows the rate of forcing to be chosen appropriately, giving a learning loss of O(f(n)log n) for any function f (n)→∞ as n→∞

Keywords :

Markov processes; adaptive control; probability; stochastic systems; Markov chains; adaptive control; certainty equivalence control; convergence; forcing; learning loss minimization; maximum likelihood estimate; stationary randomized control scheme; weak accessibility; Adaptive control; Arm; Costs; Optimal control; Parameter estimation; State-space methods; Stochastic processes; Stochastic systems; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 1990., Proceedings of the 29th IEEE Conference on

Conference_Location :

Honolulu, HI

Type :

conf

DOI :

10.1109/CDC.1990.203847

Filename :

203847

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3082349