Title : 
Risk-Constrained Markov Decision Processes
         
        
            Author : 
Borkar, Vivek ; Jain, R.
         
        
            Author_Institution : 
Dept. of Electr. Eng., Indian Inst. of Technol., Mumbai, Mumbai, India
         
        
        
        
        
        
        
        
            Abstract : 
We propose a new constrained Markov decision process framework with risk-type constraints. The risk metric we use is Conditional Value-at-Risk (CVaR), which is gaining popularity in finance. It is a conditional expectation but the conditioning is defined in terms of the level of the tail probability. We propose an iterative offline algorithm to find the risk-contrained optimal control policy. A two time-scale stochastic approximation-inspired `learning´ variant is also sketched, and its convergence proved to the optimal risk-constrained policy.
         
        
            Keywords : 
Markov processes; approximation theory; iterative methods; CVaR risk metric; conditional expectation; conditional value-at-risk; iterative offline algorithm; optimal risk-constrained policy; risk-constrained Markov decision process; risk-constrained optimal control; risk-type constraints; tail probability; time-scale stochastic approximation-inspired learning variant; Aerospace electronics; Approximation methods; Convergence; Markov processes; Optimization; Yttrium; Constrained Markov decision processes; risk measures; stochastic approximations;
         
        
        
            Journal_Title : 
Automatic Control, IEEE Transactions on
         
        
        
        
        
            DOI : 
10.1109/TAC.2014.2309262