DocumentCode :
427514
Title :
Strict-sense constrained Markov decision processes
Author :
Hsu, Shun-Pin ; Arapostathis, Ari
Author_Institution :
Dept. of Electr. Eng., Nat. Chi-Nan Univ., Nantou
Volume :
1
fYear :
0
fDate :
0-0 0
Firstpage :
194
Abstract :
We introduce the strict-sense constrained Markov decision processes by extending the ideas of classical constrained Markov decision process and the safety control of discrete event systems. In our setting a convex set of constraint specified by a set of linear inequalities is given for the system´s state probability distribution. A distribution is safe if it is in the set of constraint. A policy is safe if it makes a state distribution that is initially safe remain safe after the process starts. Under the assumption of complete state observation, we first identify the optimal safe policy that minimizes the pre-specified cost function. Then, for a given safe policy, we provide an iterative algorithm for computing the maximum set of safe initial distributions corresponding to the policy. Under the assumption that the policy induces a unique limiting distribution in the interior of set of constraint, we give an explicit upper bound on the number of steps needed for the termination of the algorithm. In particular, we give the explicit expression for the maximum set of safe initial distributions in the two-state system, for which we show that at most one iteration is needed in running the algorithm
Keywords :
Markov processes; discrete event systems; iterative methods; linear matrix inequalities; safety systems; statistical distributions; Markov decision processes; discrete event system; iterative algorithm; linear inequalities; safety control; state observation; state probability distribution; strict-sense constrained; Automatic control; Control systems; Cost function; Discrete event systems; Dynamic programming; Electrical safety; Equations; Iterative algorithms; Probability distribution; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Conference_Location :
The Hague
ISSN :
1062-922X
Print_ISBN :
0-7803-8566-7
Type :
conf
DOI :
10.1109/ICSMC.2004.1398296
Filename :
1398296
Link To Document :
بازگشت