مرکز منطقه ای اطلاع رساني علوم و فناوري - Simulation-based optimization of Markov reward processes: implementation issues

DocumentCode :

1806076

Title :

Simulation-based optimization of Markov reward processes: implementation issues

Author :

Marbach, Peter ; Tsitsiklis, John N.

Author_Institution :

Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA

Volume :

fYear :

1999

fDate :

1999

Firstpage :

1769

Abstract :

We consider discrete time, finite state space Markov reward processes which depend on a set of parameters. Previously, we proposed a simulation-based methodology to tune the parameters to optimize the average reward. The resulting algorithms converge with probability 1, but may have a high variance. Here, we propose two approaches to reduce the variance, which however introduce a new bias into the update direction. We report numerical results which indicate that the resulting algorithms are robust with respect to a small bias

Keywords :

Markov processes; decision theory; optimisation; probability; state-space methods; Markov reward processes; average reward; discrete time systems; optimization; probability; simulation; state space method; Analytical models; Approximation algorithms; Contracts; Convergence; Optimization methods; Robustness; State estimation; State-space methods; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 1999. Proceedings of the 38th IEEE Conference on

Conference_Location :

Phoenix, AZ

ISSN :

0191-2216

Print_ISBN :

0-7803-5250-5

Type :

conf

DOI :

10.1109/CDC.1999.830889

Filename :

830889

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1806076