مرکز منطقه ای اطلاع رساني علوم و فناوري - Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes

DocumentCode :

1300835

Title :

Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes

Author :

Guo, Xianping ; Song, Xinyuan

Author_Institution :

Sch. of Math. & Comput. Sci., Zhongshan Univ., Guangzhou, China

Volume :

Issue :

fYear :

2009

Firstpage :

2151

Lastpage :

2157

Abstract :

This technical note deals with the mean variance problem (known as the average variance (AV) minimization problem) for finite continuous time Markov decision processes. We first introduce a so called G-condition which is weaker than the well known ergodicity and unichain conditions and sufficient for the finiteness of the AV of a policy. Also, we present an example of a policy having infinite AV when the G-condition is not satisfied. Under the G-condition we prove that the AV criterion can be transformed into an equivalent mean (or expected) average criterion by using a martingale technique and an observation from the canonical form of a transition rate matrix, and thus the existence and calculation of an AV minimal policy over a class of mean optimal policies are obtained by a policy iteration algorithm in an finite number of iterations. As byproduct, we obtain some interesting new results about the mean average optimality.

Keywords :

Markov processes; continuous time systems; iterative methods; G-condition; average variance minimization problem; canonical form; equivalent mean average criterion; ergodicity condition; finite continuous time Markov decision process; martingale technique; mean variance criteria; policy iteration algorithm; transition rate matrix; unichain condition; Councils; Mathematics; Minimization methods; Portfolios; Statistics; Terminology; AMS (2000): 90C40, 93E20; G-condition; finite continuous-time Markov decision process (CTMDP); mean-variance; policy iteration algorithm; variance minimization policy;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.2009.2023833

Filename :

5208190

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1300835