مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving reinforcement learning using temporal-difference network EUROCON2009

DocumentCode :

2584217

Title :

Improving reinforcement learning using temporal-difference network EUROCON2009

Author :

Karbasian, Habib ; Ahmadabadi, Majid N. ; Araabi, Babak N.

Author_Institution :

Control & Intell. Process. Center of Excellence, Univ. of Tehran, Tehran, Iran

fYear :

2009

fDate :

18-23 May 2009

Firstpage :

1716

Lastpage :

1722

Abstract :

Reinforcement learning has been one of popular learning methods for many problems in many different domains. The important point for this method is how fast and efficient it is to learn a new problem. In this paper, we present a new approach to increase the efficiency of the reinforcement learning method with the great help of a predictive model of the problem´s environment called temporal-difference network along with observation. This TD network is nourished with the knowledge extracted from another problem with the same task using TD network. First a reinforcement-learning agent tries to learn its environment for the task of wall following. After that we train temporal-difference network (TDN) with intervening observation in the brain of the agent in order to gain a predictive model of the environment. Later the most promising sequences of action-observation of the given environment will be extracted as knowledge to strengthen the reinforcement learning problem in a new environment. Finally this knowledge helps the reinforcement procedure to produce more efficient results.

Keywords :

Markov processes; decision theory; intelligent robots; knowledge acquisition; learning (artificial intelligence); learning systems; mobile robots; predictive control; POMDP; TD network; knowledge extraction; partially observable Markov decision process; predictive model; reinforcement learning agent method; robot wall-following; temporal-difference network; Intelligent agent; Intelligent control; Intelligent robots; Intelligent sensors; Learning systems; Mathematics; Physics computing; Predictive models; Process control; Robot sensing systems; Concept; MDP; POMDP; Reinforcement Learning; Temporal-Difference Network;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

EUROCON 2009, EUROCON '09. IEEE

Conference_Location :

St.-Petersburg

Print_ISBN :

978-1-4244-3860-0

Electronic_ISBN :

978-1-4244-3861-7

Type :

conf

DOI :

10.1109/EURCON.2009.5167875

Filename :

5167875

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2584217