Robust Markov Decision Processes using Sigma Point sampling

Author

Bertuccelli, L.F. ; How, J.P.

Author_Institution

Dept. of Aeronaut. & Astronaut., MIT, Cambridge, MA

fYear

2008

fDate

11-13 June 2008

Firstpage

5003

Lastpage

5008

Abstract

This paper presents a new robust decision making algorithm that accounts for model uncertainty in finite state/action, Markov Decision Processes (MDPs). In particular we generate robust and optimal control policies using Sigma Point sampling methods for dynamic multi-stage problems where the probabilistic transition model of the MDP may be fixed, but uncertain. In the case of poorly known transition model governing a MDP, this paper shows that the total number of scenarios in a scenario-based robust optimization may be decreased by generating a small number of appropriately chosen samples of the model. The robust policy for the worst- case instance of the data can be approximated by identifying the minimum objective function obtained from these realizations. This paper compares the proposed approach to more direct sampling-based approaches in a machine repair problem. The numerical examples show reduction in the total number of simulations required to obtain robust solutions while achieving optimal results.

Keywords

Markov processes; decision making; optimal control; robust control; sampling methods; machine repair problem; optimal control policies; robust Markov decision processes; robust control policies; robust decision-making algorithm; scenario-based robust optimization; sigma point sampling methods; Analysis of variance; Computational modeling; Cost function; Decision making; Infinite horizon; Optimal control; Robust control; Robustness; Sampling methods; Uncertainty;

fLanguage

English

Publisher

ieee

Conference_Titel

American Control Conference, 2008

Conference_Location

Seattle, WA

ISSN

0743-1619

Print_ISBN

978-1-4244-2078-0

Electronic_ISBN

0743-1619

Type

conf

DOI

10.1109/ACC.2008.4587287

Filename

4587287