Title :
Simulation-based optimization of Markov reward processes: implementation issues
Author :
Marbach, Peter ; Tsitsiklis, John N.
Author_Institution :
Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA
Abstract :
We consider discrete time, finite state space Markov reward processes which depend on a set of parameters. Previously, we proposed a simulation-based methodology to tune the parameters to optimize the average reward. The resulting algorithms converge with probability 1, but may have a high variance. Here, we propose two approaches to reduce the variance, which however introduce a new bias into the update direction. We report numerical results which indicate that the resulting algorithms are robust with respect to a small bias
Keywords :
Markov processes; decision theory; optimisation; probability; state-space methods; Markov reward processes; average reward; discrete time systems; optimization; probability; simulation; state space method; Analytical models; Approximation algorithms; Contracts; Convergence; Optimization methods; Robustness; State estimation; State-space methods; Stochastic processes;
Conference_Titel :
Decision and Control, 1999. Proceedings of the 38th IEEE Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5250-5
DOI :
10.1109/CDC.1999.830889