DocumentCode :
3783457
Title :
Asymptotic analysis of stochastic approximation algorithms under violated Kushner-Clark conditions with applications
Author :
V. Tadic
Author_Institution :
Dept. of Electr. & Electron. Eng., Melbourne Univ., Parkville, Vic., Australia
Volume :
3
fYear :
2000
Firstpage :
2875
Abstract :
Motivated by the problem of the asymptotic behavior of temporal-difference learning algorithms with nonlinear function approximation, the local almost sure asymptotic properties of stochastic approximation algorithms are analyzed for violated Kushner-Clark conditions (1978). First, the algorithms with additive noise are analyzed for the case where the noise is state-dependent. The obtained results are then applied to the analysis of the algorithms with nonadditive noise. Using these general results, the analysis of temporal-difference learning algorithms is carried out for the case of a general nonlinear function approximation and under the assumptions allowing the underlying Markov chain to be positive Harris. The general results are also illustrated by an example where the noise is nonadditive, correlated and satisfies strong mixing conditions.
Keywords :
"Algorithm design and analysis","Stochastic processes","Approximation algorithms","Function approximation","Stochastic resonance","Additive noise","Convergence","Learning","Stability analysis","Ear"
Publisher :
ieee
Conference_Titel :
Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
ISSN :
0191-2216
Print_ISBN :
0-7803-6638-7
Type :
conf
DOI :
10.1109/CDC.2000.914246
Filename :
914246
Link To Document :
بازگشت