DocumentCode :
2584217
Title :
Improving reinforcement learning using temporal-difference network EUROCON2009
Author :
Karbasian, Habib ; Ahmadabadi, Majid N. ; Araabi, Babak N.
Author_Institution :
Control & Intell. Process. Center of Excellence, Univ. of Tehran, Tehran, Iran
fYear :
2009
fDate :
18-23 May 2009
Firstpage :
1716
Lastpage :
1722
Abstract :
Reinforcement learning has been one of popular learning methods for many problems in many different domains. The important point for this method is how fast and efficient it is to learn a new problem. In this paper, we present a new approach to increase the efficiency of the reinforcement learning method with the great help of a predictive model of the problem´s environment called temporal-difference network along with observation. This TD network is nourished with the knowledge extracted from another problem with the same task using TD network. First a reinforcement-learning agent tries to learn its environment for the task of wall following. After that we train temporal-difference network (TDN) with intervening observation in the brain of the agent in order to gain a predictive model of the environment. Later the most promising sequences of action-observation of the given environment will be extracted as knowledge to strengthen the reinforcement learning problem in a new environment. Finally this knowledge helps the reinforcement procedure to produce more efficient results.
Keywords :
Markov processes; decision theory; intelligent robots; knowledge acquisition; learning (artificial intelligence); learning systems; mobile robots; predictive control; POMDP; TD network; knowledge extraction; partially observable Markov decision process; predictive model; reinforcement learning agent method; robot wall-following; temporal-difference network; Intelligent agent; Intelligent control; Intelligent robots; Intelligent sensors; Learning systems; Mathematics; Physics computing; Predictive models; Process control; Robot sensing systems; Concept; MDP; POMDP; Reinforcement Learning; Temporal-Difference Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
EUROCON 2009, EUROCON '09. IEEE
Conference_Location :
St.-Petersburg
Print_ISBN :
978-1-4244-3860-0
Electronic_ISBN :
978-1-4244-3861-7
Type :
conf
DOI :
10.1109/EURCON.2009.5167875
Filename :
5167875
Link To Document :
بازگشت