DocumentCode :
2709985
Title :
Learning to generate subgoals for action sequences
Author :
Schmidhuber, Jürgen
Author_Institution :
Inst. fur Inf., Tech. Univ. Muenchen, Germany
fYear :
1991
fDate :
8-14 Jul 1991
Abstract :
Summary form only given. None of the existing learning algorithms for neural networks in time-varying environments addresses the problems of learning to `divide and conquer´. Algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform `compositional learning´. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of `time-bridging´ adaptive models that predict the effects of the system´s subprograms. An experiment on obstacle avoidance in a two-dimensional environment illustrates the approach
Keywords :
adaptive systems; learning systems; neural nets; planning (artificial intelligence); time-varying systems; action sequences; adaptive critic methods; compositional learning; divide and conquer method; dynamic control problems; gradient descent; learning algorithms; neural networks; obstacle avoidance; subgoals; subprogram effects prediction; time bridging adaptive models; time lags; time-varying environments; Adaptive control; Neural networks; Permission; Predictive models; Programmable control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-0164-1
Type :
conf
DOI :
10.1109/IJCNN.1991.155375
Filename :
155375
Link To Document :
بازگشت