DocumentCode :
14352
Title :
Risk-Constrained Markov Decision Processes
Author :
Borkar, Vivek ; Jain, R.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Mumbai, Mumbai, India
Volume :
59
Issue :
9
fYear :
2014
fDate :
Sept. 2014
Firstpage :
2574
Lastpage :
2579
Abstract :
We propose a new constrained Markov decision process framework with risk-type constraints. The risk metric we use is Conditional Value-at-Risk (CVaR), which is gaining popularity in finance. It is a conditional expectation but the conditioning is defined in terms of the level of the tail probability. We propose an iterative offline algorithm to find the risk-contrained optimal control policy. A two time-scale stochastic approximation-inspired `learning´ variant is also sketched, and its convergence proved to the optimal risk-constrained policy.
Keywords :
Markov processes; approximation theory; iterative methods; CVaR risk metric; conditional expectation; conditional value-at-risk; iterative offline algorithm; optimal risk-constrained policy; risk-constrained Markov decision process; risk-constrained optimal control; risk-type constraints; tail probability; time-scale stochastic approximation-inspired learning variant; Aerospace electronics; Approximation methods; Convergence; Markov processes; Optimization; Yttrium; Constrained Markov decision processes; risk measures; stochastic approximations;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2014.2309262
Filename :
6750726
Link To Document :
بازگشت