• DocumentCode
    2253470
  • Title

    Approximate dynamic programming using support vector regression

  • Author

    Bethke, Brett ; How, Jonathan P. ; Ozdaglar, Asuman

  • Author_Institution
    Dept. of Aeronaut. & Astronaut., MIT, Cambridge, MA, USA
  • fYear
    2008
  • fDate
    9-11 Dec. 2008
  • Firstpage
    3811
  • Lastpage
    3816
  • Abstract
    This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architectures in approximate dynamic programming problems, explains some difficulties encountered by these architectures, and argues that SVR-based architectures can avoid some of these difficulties. A key contribution of this paper is to present an extension of the SVR problem to carry out approximate policy iteration by forcing the Bellman error to zero at selected states. The algorithm does not require trajectory simulations to be performed and is able to utilize a rich set of basis functions in a computationally efficient way. Computational results for an example problem are shown.
  • Keywords
    approximation theory; dynamic programming; iterative methods; regression analysis; support vector machines; approximate dynamic programming; approximate policy iteration algorithm; support vector regression; Computational modeling; Computer architecture; Costs; Decision making; Dynamic programming; Finance; Function approximation; Neural networks; Space technology; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2008. CDC 2008. 47th IEEE Conference on
  • Conference_Location
    Cancun
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-4244-3123-6
  • Electronic_ISBN
    0191-2216
  • Type

    conf

  • DOI
    10.1109/CDC.2008.4739322
  • Filename
    4739322