مرکز منطقه ای اطلاع رساني علوم و فناوري - An on-line algorithm for dynamic reinforcement learning and planning in reactive environments

DocumentCode :

2695559

Title :

An on-line algorithm for dynamic reinforcement learning and planning in reactive environments

Author :

Schmidhuber, Jürgen

fYear :

1990

fDate :

17-21 June 1990

Firstpage :

253

Abstract :

An online learning algorithm for reinforcement learning with continually running recurrent networks in nonstationary reactive environments is described. Various kinds of reinforcement are considered as special types of input to an agent living in the environment. The agent´s only goal is to maximize the amount of reinforcement received over time. Supervised learning techniques for recurrent networks serve to construct a differentiable model of the environmental dynamics which includes a model of future reinforcement. This model is used for learning goal-directed behavior in an online fashion. The possibility of using the system for planning future action sequences is investigated and this approach is compared to approaches based on temporal difference methods. A connection to metalearning (learning how to learn) is noted

Keywords :

learning systems; neural nets; parallel algorithms; continually running recurrent networks; differentiable model; environmental dynamics; future reinforcement; learning goal-directed behavior; metalearning; nonstationary reactive environments; online learning algorithm; planning future action sequences; recurrent networks; reinforcement learning; temporal difference methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 1990., 1990 IJCNN International Joint Conference on

Conference_Location :

San Diego, CA, USA

Type :

conf

DOI :

10.1109/IJCNN.1990.137723

Filename :

5726682

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2695559