DocumentCode :
3424612
Title :
Singular control for discounted Markov Decision Processes in a general state space
Author :
Costa, O.L.V. ; Dufour, F.
Author_Institution :
Dept. de Eng. de Telecomun. e Controle, Escola Politec. da Univ. de Sao Paulo, Sao Paulo, Brazil
fYear :
2011
fDate :
12-15 Dec. 2011
Firstpage :
7087
Lastpage :
7092
Abstract :
This paper studies the asymptotic optimality of discrete-time Markov Decision Processes (MDP´s in short) with general state space and action space and having weak and strong interactions. By using a similar approach as developed in [1], the idea in this paper is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter ∈ >; 0 in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as ∈ goes to zero. In the sequel it is shown that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal.
Keywords :
Markov processes; discrete time systems; feedback; optimal control; state-space methods; action space; asymptotic optimality; discounted Markov decision processes; discrete-time processes; feedback control policy; general state space; limit averaged control problem; optimal feedback policy; original control problem; singular control; singularly perturbed Markov model; transition kernel; value function; Aerospace electronics; Convergence; Equations; Feedback control; Kernel; Markov processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
Conference_Location :
Orlando, FL
ISSN :
0743-1546
Print_ISBN :
978-1-61284-800-6
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2011.6160377
Filename :
6160377
Link To Document :
بازگشت