مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate model-assisted Neural Fitted Q-Iteration

DocumentCode :

1798017

Title :

Approximate model-assisted Neural Fitted Q-Iteration

Author :

Lampe, Thomas ; Riedmiller, Martin

Author_Institution :

Dept. of Comput. Sci., Albert-Ludwigs-Univ., Freiburg, Germany

fYear :

2014

fDate :

6-11 July 2014

Firstpage :

2698

Lastpage :

2704

Abstract :

In this work, we propose an extension to the Neural Fitted Q-Iteration algorithm that utilizes a learned model to generate virtual trajectories which are used for updating the Q-function. Compared to standard NFQ, this combination has the potential to greatly reduce the amount of system interaction required to learn a good policy. At the same time, the approach still maintains the generalization ability of Q-learning. We provide a general formulation for approximate model-assisted fitted Q-learning, and examine the advantages of its neural implementation regarding interaction time and robustness. Its capabilities are illustrated with first results on a benchmark cart-pole regulation task, on which our method turns out to provide more general policies using much less interaction time.

Keywords :

learning (artificial intelligence); neural nets; Q-function; approximate model-assisted neural fitted Q-iteration; benchmark cart-pole regulation task; generalization ability; learned model; neural implementation; standard NFQ; virtual trajectories; Approximation algorithms; Data models; Robustness; Standards; Testing; Training; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4799-6627-1

Type :

conf

DOI :

10.1109/IJCNN.2014.6889733

Filename :

6889733

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1798017