DocumentCode :
1906866
Title :
Automatic Discovery and Transfer of MAXQ Hierarchies in a Complex System
Author :
Hongbing Wang ; Wenya Li ; Xuan Zhou
Author_Institution :
Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
Volume :
1
fYear :
2012
fDate :
7-9 Nov. 2012
Firstpage :
1157
Lastpage :
1162
Abstract :
Reinforcement learning has been an important category of machine learning approaches exhibiting self-learning and online learning characteristics. Using reinforcement learning, an agent can learn its behaviors through trial-and-error interactions with a dynamic environment and finally come up with an optimal strategy. Reinforcement learning suffers the curse of dimensionality, though there has been significant progress to overcome this issue in recent years. MAXQ is one of the most common approaches for reinforcement learning. To function properly, MAXQ requires a decomposition of the agent´s task into a task hierarchy. Previously, the decomposition can only be done manually. In this paper, we propose a mechanism for automatic subtask discovery. The mechanism applies clustering to automatically construct task hierarchy required by MAXQ, such that MAXQ can be fully automated. We present the design of our mechanism, and demonstrate its effectiveness through theoretical analysis and an extensive experimental evaluation.
Keywords :
multi-agent systems; optimisation; pattern clustering; unsupervised learning; MAXQ hierarchy; agent behavior; agent task decomposition; automatic discovery; automatic subtask discovery; clustering; complex system; dimensionality curse; dynamic environment; machine learning; online learning characteristics; optimal strategy; reinforcement learning; self-learning; task hierarchy; trial-and-error interaction; Bayes methods; Clustering algorithms; Complexity theory; Learning (artificial intelligence); Markov processes; Silicon; Trajectory; Clustering; MAXQ; Reinforcement Learning; System of Systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location :
Athens
ISSN :
1082-3409
Print_ISBN :
978-1-4799-0227-9
Type :
conf
DOI :
10.1109/ICTAI.2012.165
Filename :
6495182
Link To Document :
بازگشت