مرکز منطقه ای اطلاع رساني علوم و فناوري - Asymptotic analysis of temporal-difference learning algorithms with linear function approximation

DocumentCode :

3782767

Title :

Asymptotic analysis of temporal-difference learning algorithms with linear function approximation

Author :

V. Tadic

Author_Institution :

Mihajlo Pupin Inst., Belgrade, Serbia

Volume :

fYear :

1999

Firstpage :

5050

Abstract :

The asymptotic properties of temporal-difference learning algorithms with linear function approximation are analyzed in the paper. The analysis is carried out in the context of the approximation of a discounted cost-to-go function associated to an uncontrolled Markov chain with an uncountable finite-dimensional state-space.

Keywords :

"Algorithm design and analysis","Approximation algorithms","Function approximation","Convergence","Difference equations","Random variables"

Publisher :

ieee

Conference_Titel :

Decision and Control, 1999. Proceedings of the 38th IEEE Conference on

ISSN :

0191-2216

Print_ISBN :

0-7803-5250-5

Type :

conf

DOI :

10.1109/CDC.1999.833350

Filename :

833350

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3782767