• DocumentCode
    2475719
  • Title

    Approximate Dynamic Programming Based on Expansive Projections

  • Author

    Arruda, Edilson F. ; Val, João B R do

  • Author_Institution
    Center for Syst. & Control, National Lab. for Sci. Comput., Petropolis
  • fYear
    2006
  • fDate
    13-15 Dec. 2006
  • Firstpage
    5537
  • Lastpage
    5542
  • Abstract
    We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximation architecture and generalizes existing results in the literature derived for particular approximation schemes. Additionally, we show how to obtain a convergent approximate mapping whose fixed point is the projection in the approximation space of a fixed point of the exact dynamic programming mapping with regards to a suitable subset norm. This result relies on evaluating the difference between successive iterates in the selected subset norm, which provides convergent procedures for any arbitrary approximation architecture
  • Keywords
    approximation theory; convergence; dynamic programming; function approximation; iterative methods; approximate dynamic programming; convergent approximate value iteration algorithms; expansive projections; function approximation; Approximation algorithms; Computer architecture; Convergence; Dynamic programming; Function approximation; Heuristic algorithms; Large-scale systems; Monitoring; State-space methods; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2006 45th IEEE Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    1-4244-0171-2
  • Type

    conf

  • DOI
    10.1109/CDC.2006.376823
  • Filename
    4177621