• DocumentCode
    14352
  • Title

    Risk-Constrained Markov Decision Processes

  • Author

    Borkar, Vivek ; Jain, R.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Mumbai, Mumbai, India
  • Volume
    59
  • Issue
    9
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    2574
  • Lastpage
    2579
  • Abstract
    We propose a new constrained Markov decision process framework with risk-type constraints. The risk metric we use is Conditional Value-at-Risk (CVaR), which is gaining popularity in finance. It is a conditional expectation but the conditioning is defined in terms of the level of the tail probability. We propose an iterative offline algorithm to find the risk-contrained optimal control policy. A two time-scale stochastic approximation-inspired `learning´ variant is also sketched, and its convergence proved to the optimal risk-constrained policy.
  • Keywords
    Markov processes; approximation theory; iterative methods; CVaR risk metric; conditional expectation; conditional value-at-risk; iterative offline algorithm; optimal risk-constrained policy; risk-constrained Markov decision process; risk-constrained optimal control; risk-type constraints; tail probability; time-scale stochastic approximation-inspired learning variant; Aerospace electronics; Approximation methods; Convergence; Markov processes; Optimization; Yttrium; Constrained Markov decision processes; risk measures; stochastic approximations;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2014.2309262
  • Filename
    6750726