• DocumentCode
    3645983
  • Title

    Asymptotic bias of stochastic gradient search

  • Author

    Vladislav B. Tadić;A. Doucet

  • Author_Institution
    Department of Mathematics, University of Bristol, BS8 1TW, United Kingdom
  • fYear
    2011
  • Firstpage
    722
  • Lastpage
    727
  • Abstract
    The asymptotic behavior of the stochastic gradient algorithm with a biased gradient estimator is analyzed. Relying on arguments based on differential geometry (Yomdin theorem and Lojasiewicz inequality), relatively tight bounds on the asymptotic bias of the iterates generated by such an algorithm are derived. The obtained results hold under mild and verifiable conditions and cover a broad class of complex stochastic gradient algorithms. Using these results, the asymptotic properties of the actor-critic reinforcement learning are studied.
  • Keywords
    "Signal processing algorithms","Estimation","Approximation methods","Learning","Markov processes","Approximation algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-61284-800-6
  • Type

    conf

  • DOI
    10.1109/CDC.2011.6160812
  • Filename
    6160812