DocumentCode
3645983
Title
Asymptotic bias of stochastic gradient search
Author
Vladislav B. Tadić;A. Doucet
Author_Institution
Department of Mathematics, University of Bristol, BS8 1TW, United Kingdom
fYear
2011
Firstpage
722
Lastpage
727
Abstract
The asymptotic behavior of the stochastic gradient algorithm with a biased gradient estimator is analyzed. Relying on arguments based on differential geometry (Yomdin theorem and Lojasiewicz inequality), relatively tight bounds on the asymptotic bias of the iterates generated by such an algorithm are derived. The obtained results hold under mild and verifiable conditions and cover a broad class of complex stochastic gradient algorithms. Using these results, the asymptotic properties of the actor-critic reinforcement learning are studied.
Keywords
"Signal processing algorithms","Estimation","Approximation methods","Learning","Markov processes","Approximation algorithms"
Publisher
ieee
Conference_Titel
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
ISSN
0191-2216
Print_ISBN
978-1-61284-800-6
Type
conf
DOI
10.1109/CDC.2011.6160812
Filename
6160812
Link To Document