• DocumentCode
    2391971
  • Title

    An application of convex optimization concepts to approximate dynamic programming

  • Author

    Arruda, Edilson F. ; Fragoso, Marcelo D. ; Val, João Bosco R Do

  • Author_Institution
    Dept. of Syst. & Control, Nat. Lab. for Sci. Comput., Petropolis
  • fYear
    2008
  • fDate
    11-13 June 2008
  • Firstpage
    4238
  • Lastpage
    4243
  • Abstract
    This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic (DP) programming problems. The so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DP problem. This fact motivates the introduction of an AVI algorithm with local search that seeks an approximate solution in a lower dimensional space called approximation architecture. The optimality of a point in the approximation architecture is characterized by means of convex optimization concepts and necessary and sufficient conditions to global optimality are derived. To illustrate the method, two examples are presented which were previously explored in the literature.
  • Keywords
    Banach spaces; approximation theory; convex programming; dynamic programming; iterative methods; Banach space; Bellman residual; approximate value iteration algorithm; convex optimization concept; dynamic programming; Approximation algorithms; Computer architecture; Convergence; Dynamic programming; Function approximation; Heuristic algorithms; Large-scale systems; Robustness; Stochastic processes; Sufficient conditions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference, 2008
  • Conference_Location
    Seattle, WA
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4244-2078-0
  • Electronic_ISBN
    0743-1619
  • Type

    conf

  • DOI
    10.1109/ACC.2008.4587159
  • Filename
    4587159