• DocumentCode
    2394182
  • Title

    Robust Markov Decision Processes using Sigma Point sampling

  • Author

    Bertuccelli, L.F. ; How, J.P.

  • Author_Institution
    Dept. of Aeronaut. & Astronaut., MIT, Cambridge, MA
  • fYear
    2008
  • fDate
    11-13 June 2008
  • Firstpage
    5003
  • Lastpage
    5008
  • Abstract
    This paper presents a new robust decision making algorithm that accounts for model uncertainty in finite state/action, Markov Decision Processes (MDPs). In particular we generate robust and optimal control policies using Sigma Point sampling methods for dynamic multi-stage problems where the probabilistic transition model of the MDP may be fixed, but uncertain. In the case of poorly known transition model governing a MDP, this paper shows that the total number of scenarios in a scenario-based robust optimization may be decreased by generating a small number of appropriately chosen samples of the model. The robust policy for the worst- case instance of the data can be approximated by identifying the minimum objective function obtained from these realizations. This paper compares the proposed approach to more direct sampling-based approaches in a machine repair problem. The numerical examples show reduction in the total number of simulations required to obtain robust solutions while achieving optimal results.
  • Keywords
    Markov processes; decision making; optimal control; robust control; sampling methods; machine repair problem; optimal control policies; robust Markov decision processes; robust control policies; robust decision-making algorithm; scenario-based robust optimization; sigma point sampling methods; Analysis of variance; Computational modeling; Cost function; Decision making; Infinite horizon; Optimal control; Robust control; Robustness; Sampling methods; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference, 2008
  • Conference_Location
    Seattle, WA
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4244-2078-0
  • Electronic_ISBN
    0743-1619
  • Type

    conf

  • DOI
    10.1109/ACC.2008.4587287
  • Filename
    4587287