مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

DocumentCode :

1341101

Title :

Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

Author :

Prieto-Rumeau, Tomás ; Lorenzo, Jos éMaría

Author_Institution :

Dept. of Stat. & Oper. Res., UNED, Madrid, Spain

Volume :

Issue :

fYear :

2010

Firstpage :

201

Lastpage :

207

Abstract :

We study the approximation of an ergodic average reward continuous-time denumerable state Markov decision process (MDP) by means of a sequence of MDPs. Our results include the convergence of the corresponding optimal policies and the optimal gains. For a controlled upwardly skip-free process, we show some computational results to illustrate the convergence theorems.

Keywords :

Markov processes; continuous time systems; convergence; optimal control; Markov decision process; convergence theorems; ergodic average reward continuous-time control; optimal gains; optimal policies; Adaptive control; Convergence; Operations research; Optimal control; Process control; State-space methods; Statistics; Terminology; Approximation of control problems; Ergodic Markov decision processes (MDPs); policy iteration algorithm;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.2009.2033848

Filename :

5340606

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1341101