• DocumentCode
    1798017
  • Title

    Approximate model-assisted Neural Fitted Q-Iteration

  • Author

    Lampe, Thomas ; Riedmiller, Martin

  • Author_Institution
    Dept. of Comput. Sci., Albert-Ludwigs-Univ., Freiburg, Germany
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    2698
  • Lastpage
    2704
  • Abstract
    In this work, we propose an extension to the Neural Fitted Q-Iteration algorithm that utilizes a learned model to generate virtual trajectories which are used for updating the Q-function. Compared to standard NFQ, this combination has the potential to greatly reduce the amount of system interaction required to learn a good policy. At the same time, the approach still maintains the generalization ability of Q-learning. We provide a general formulation for approximate model-assisted fitted Q-learning, and examine the advantages of its neural implementation regarding interaction time and robustness. Its capabilities are illustrated with first results on a benchmark cart-pole regulation task, on which our method turns out to provide more general policies using much less interaction time.
  • Keywords
    learning (artificial intelligence); neural nets; Q-function; approximate model-assisted neural fitted Q-iteration; benchmark cart-pole regulation task; generalization ability; learned model; neural implementation; standard NFQ; virtual trajectories; Approximation algorithms; Data models; Robustness; Standards; Testing; Training; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889733
  • Filename
    6889733