DocumentCode
3783457
Title
Asymptotic analysis of stochastic approximation algorithms under violated Kushner-Clark conditions with applications
Author
V. Tadic
Author_Institution
Dept. of Electr. & Electron. Eng., Melbourne Univ., Parkville, Vic., Australia
Volume
3
fYear
2000
Firstpage
2875
Abstract
Motivated by the problem of the asymptotic behavior of temporal-difference learning algorithms with nonlinear function approximation, the local almost sure asymptotic properties of stochastic approximation algorithms are analyzed for violated Kushner-Clark conditions (1978). First, the algorithms with additive noise are analyzed for the case where the noise is state-dependent. The obtained results are then applied to the analysis of the algorithms with nonadditive noise. Using these general results, the analysis of temporal-difference learning algorithms is carried out for the case of a general nonlinear function approximation and under the assumptions allowing the underlying Markov chain to be positive Harris. The general results are also illustrated by an example where the noise is nonadditive, correlated and satisfies strong mixing conditions.
Keywords
"Algorithm design and analysis","Stochastic processes","Approximation algorithms","Function approximation","Stochastic resonance","Additive noise","Convergence","Learning","Stability analysis","Ear"
Publisher
ieee
Conference_Titel
Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
ISSN
0191-2216
Print_ISBN
0-7803-6638-7
Type
conf
DOI
10.1109/CDC.2000.914246
Filename
914246
Link To Document