• DocumentCode
    2751430
  • Title

    A hybrid architecture for function approximation

  • Author

    Osman, H.E.

  • Author_Institution
    Osman Comput. Intell. & Syst. Sci., Tokyo Inst. of Technol., Tokyo
  • fYear
    2008
  • fDate
    13-16 July 2008
  • Firstpage
    1103
  • Lastpage
    1108
  • Abstract
    This paper proposes a new architecture to build a hybrid value function estimation based on a combination of temporal-different (TD) and on-line variant of random forest (RF). We call this implementation random-TD. First RF is induced into on-line mode in order to deal with large state space and memory constraints, while state-action mapping is based on the Bellman error, or on the TD error. The approach iteratively improves its value function by exploiting only relevant parts of action space. We evaluate the potential of the proposed procedure in terms of a reduction in the Bellman error with extended empirical studies on high-dimensional control problems (Ailerons, Elevator, Kinematics, and Friedman). The results demonstrate that our approach can significantly improve the performance of TD methods and speed up learning process.
  • Keywords
    estimation theory; function approximation; learning (artificial intelligence); random processes; Bellman error; TD error; function approximation; hybrid value function estimation; memory constraint; random forest; random-temporal-different; reinforcement leaning; state space; state-action mapping; Classification tree analysis; Computational intelligence; Computer architecture; Decision trees; Function approximation; Learning; Paper technology; Radio frequency; Space technology; State-space methods; TD-learning; function approximation; random forests; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Informatics, 2008. INDIN 2008. 6th IEEE International Conference on
  • Conference_Location
    Daejeon
  • ISSN
    1935-4576
  • Print_ISBN
    978-1-4244-2170-1
  • Electronic_ISBN
    1935-4576
  • Type

    conf

  • DOI
    10.1109/INDIN.2008.4618267
  • Filename
    4618267