Title of article :
Dynamically identifying regenerative cycles in Simulation-based Optimization Algorithms for Markov chains
Author/Authors :
E.، Campos-Nanez, نويسنده , , S.D.، Patek, نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Pages :
4
From page :
1022
To page :
1025
Abstract :
Simulation-based algorithms for maximizing the average reward of a parameterized Markov chain often rely upon the existence of a state which is recurrent for all choices of parameter values. The question of which recurrent state should serve to mark the end of a regenerative cycle is a very important practical consideration in applications. Even when all of the states of the process are recurrent, some states tend to be visited more often than others, and lengthy renewal cycles tend to result in high variance estimates of the gradient. To address this difficulty, we analyze a mechanism for adjusting this special state dynamically (i/sup */-adaptation) as applied to the "batch" simulation-based optimization algorithm of a previous paper. We show that the desirable convergence properties of the original "batch" algorithm are retained with i/sup */-adaptation, namely the almost sure convergence of the parameter vector to a critical point.
Keywords :
Hydrograph
Journal title :
IEEE Transactions on Automatic Control
Serial Year :
2004
Journal title :
IEEE Transactions on Automatic Control
Record number :
97812
Link To Document :
بازگشت