DocumentCode
2475719
Title
Approximate Dynamic Programming Based on Expansive Projections
Author
Arruda, Edilson F. ; Val, João B R do
Author_Institution
Center for Syst. & Control, National Lab. for Sci. Comput., Petropolis
fYear
2006
fDate
13-15 Dec. 2006
Firstpage
5537
Lastpage
5542
Abstract
We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximation architecture and generalizes existing results in the literature derived for particular approximation schemes. Additionally, we show how to obtain a convergent approximate mapping whose fixed point is the projection in the approximation space of a fixed point of the exact dynamic programming mapping with regards to a suitable subset norm. This result relies on evaluating the difference between successive iterates in the selected subset norm, which provides convergent procedures for any arbitrary approximation architecture
Keywords
approximation theory; convergence; dynamic programming; function approximation; iterative methods; approximate dynamic programming; convergent approximate value iteration algorithms; expansive projections; function approximation; Approximation algorithms; Computer architecture; Convergence; Dynamic programming; Function approximation; Heuristic algorithms; Large-scale systems; Monitoring; State-space methods; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2006 45th IEEE Conference on
Conference_Location
San Diego, CA
Print_ISBN
1-4244-0171-2
Type
conf
DOI
10.1109/CDC.2006.376823
Filename
4177621
Link To Document