مرکز منطقه ای اطلاع رساني علوم و فناوري - A Basic Formula for Online Policy Gradient Algorithms

Title of article :

A Basic Formula for Online Policy Gradient Algorithms

Author/Authors :

X.-R. Cao، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2005

Pages :

From page :

696

To page :

699

Keywords :

perturbationanalysis (PA) , reinforcement learning. , online estimation , Markov decision processes , Potentials , Poisson equations , Perturbation realization

Journal title :

IEEE Transactions on Automatic Control

Serial Year :

2005

Journal title :

IEEE Transactions on Automatic Control

Record number :

386680

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=386680