مرکز منطقه ای اطلاع رساني علوم و فناوري - Markov decision processes with constrained stopping times

DocumentCode :

1743874

Title :

Markov decision processes with constrained stopping times

Author :

Woriguchi, M. ; Kurano, M. ; Yasuda, M.

Author_Institution :

Div. of Math. Sci. & Phys., Chiba Univ., Japan

Volume :

fYear :

2000

fDate :

2000

Firstpage :

706

Abstract :

The optimization problem for a stopped Markov decision process is considered to be taken over stopping times τ constrained so that E_τ⩽α for some fixed α>0. We introduce the concept of a randomized stationary stopping time which is a mixed extension of the entry time of a stopping region and prove the existence of an optimal constrained pair of stationary policy and stopping time by utilizing a Lagrange multiplier approach. Also, applying the idea of the one-step look ahead (OLA) policy the optimal constrained pair is sought concretely. As an example, constrained Markov deteriorating system is explained

Keywords :

Markov processes; decision theory; optimisation; Lagrange multiplier approach; Markov decision processes; OLA policy; constrained Markov deteriorating system; constrained stopping times; entry time mixed extension; one-step look ahead policy; optimal constrained pair; randomized stationary stopping time; Constraint optimization; Convergence; Dynamic programming; Equations; Lagrangian functions; Probability distribution; State-space methods; Stochastic processes; Time factors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2000. Proceedings of the 39th IEEE Conference on

Conference_Location :

Sydney, NSW

ISSN :

0191-2216

Print_ISBN :

0-7803-6638-7

Type :

conf

DOI :

10.1109/CDC.2000.912850

Filename :

912850

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1743874