Title :
A relative value iteration for controlled diffusions under ergodic cost
Author :
Arapostathis, Ari ; Borkar, Vivek S.
Author_Institution :
Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
Abstract :
The ergodic control problem for a non-degenerate diffusion controlled through its drift is considered under a uniform stability condition that ensures the well-posedness of the associated Hamilton-Jacobi-Bellman (HJB) equation. A nonlinear parabolic evolution equation is then proposed as a continuous time continuous state space analog of White´s `relative value iteration´ algorithm for solving the ergodic dynamic programming equation for the finite state finite action case. Its convergence to the solution of the HJB equation is established using the theory of monotone dynamical systems and also, alternatively, by using the theory of reverse martingales.
Keywords :
dynamic programming; nonlinear equations; stability; HJB equation; Hamilton-Jacobi-Bellman equation; continuous time continuous state space analog; controlled diffusions; ergodic control problem; ergodic cost; ergodic dynamic programming equation; finite state finite action case; monotone dynamical systems; nondegenerate diffusion; nonlinear parabolic evolution equation; relative value iteration algorithm; reverse martingales; uniform stability condition; Aerospace electronics; Convergence; Equations; Extraterrestrial measurements; Markov processes; Process control; Standards;
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2012.6426016