Title of article :
A Basic Formula for Online Policy Gradient Algorithms
Author/Authors :
X.-R. Cao، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2005
Pages :
4
From page :
696
To page :
699
Keywords :
perturbationanalysis (PA) , reinforcement learning. , online estimation , Markov decision processes , Potentials , Poisson equations , Perturbation realization
Journal title :
IEEE Transactions on Automatic Control
Serial Year :
2005
Journal title :
IEEE Transactions on Automatic Control
Record number :
386680
Link To Document :
https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=386680