مرکز منطقه ای اطلاع رساني علوم و فناوري - On optimal control of Markov chains with safety constraint

DocumentCode :

2294142

Title :

On optimal control of Markov chains with safety constraint

Author :

Hsu, Shun-Pin ; Arapostathis, Ari ; Kumar, Ratnesh

Author_Institution :

National Chi-Nan Univ. Electr. Eng., Nantou

fYear :

2006

fDate :

14-16 June 2006

Abstract :

We study the control of completely observed Markov chains with safety bounds as introduced by Arapostathis et al (2005), but with more general safety constraints and the added requirement of optimality. The safety bounds were specified as unit-interval valued vector pairs (lower and upper bounds for each component of the state probability distribution). In this paper we generalize the constraint set to be any linear convex set and present a way to compute a stationary control policy which is safe (i.e., maintains the safety of the distribution that is initially safe) and at the same time it is long-run average optimal. We propose a linear programming formulation for computing such a safe optimal policy. Under a simplifying assumption that the optimal policy is ergodic, we present a finitely-terminating iterative algorithm to compute the maximal invariant safe set (MISS) where the initial distribution must lie so that the future distributions always remain safe. Our approach allows us to calculate an upper bound for the number of iterations needed for the algorithm to terminate. In particular, for the two-state chains we show that at most one iteration is needed to compute the MISS

Keywords :

Markov processes; discrete event systems; iterative methods; linear programming; optimal control; statistical distributions; stochastic systems; Markov chains; finitely-terminating iterative algorithm; linear convex set; linear programming; maximal invariant safe set; optimal control; reliability; safety bounds; safety constraint; safety control; state probability distribution; stationary control policy; stochastic discrete event system; unit-interval valued vector pairs; Automatic control; Control systems; Distributed computing; Dynamic programming; Electrical safety; Equations; Linear programming; Optimal control; Probability distribution; Upper bound;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference, 2006

Conference_Location :

Minneapolis, MN

Print_ISBN :

1-4244-0209-3

Electronic_ISBN :

1-4244-0209-3

Type :

conf

DOI :

10.1109/ACC.2006.1657431

Filename :

1657431

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2294142