DocumentCode :
2584759
Title :
Concept extraction using temporal-difference network EUROCON2009
Author :
Karbasian, Habib ; Ahmadabadi, Majid N. ; Araabi, Babak N.
Author_Institution :
Control & Intell. Process. Center of Excellence, Univ. of Tehran, Tehran, Iran
fYear :
2009
fDate :
18-23 May 2009
Firstpage :
1888
Lastpage :
1894
Abstract :
In this paper, we propose a novel framework to extract temporally extended concepts in a grid world environment using a probable data structure named temporal-difference network. First a reinforcement-learning agent tries to learn its environment for the task of wall following. After that we train a newly introduced temporal-difference network (TDN) in the brain of the agent in order to gain a predictive model of the environment. At last the most promising sequences of action-observation of the given environment will be sorted out based on their probability.
Keywords :
Markov processes; decision theory; intelligent robots; learning (artificial intelligence); mobile robots; predictive control; probability; tree data structures; POMDP problem; action-observation sequence; concept extraction; grid world environment; partially-observable Markov decision process; predictive model; probability; probable tree data structure; reinforcement-learning agent; robot wall following task; temporal-difference network; Data mining; Data structures; Intelligent agent; Intelligent robots; Intelligent sensors; Learning; Predictive models; Process control; Robot sensing systems; Stochastic processes; Concept; MDP; POMDP; Reinforcement Learning; Temporal-Difference Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
EUROCON 2009, EUROCON '09. IEEE
Conference_Location :
St.-Petersburg
Print_ISBN :
978-1-4244-3860-0
Electronic_ISBN :
978-1-4244-3861-7
Type :
conf
DOI :
10.1109/EURCON.2009.5167904
Filename :
5167904
Link To Document :
بازگشت