DocumentCode
530999
Title
Efficient Value Function Approximation with Unsupervised Hierarchical Categorization for a Reinforcement Learning Agent
Author
Wang, Yongjia ; Laird, John E.
Author_Institution
EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
Volume
2
fYear
2010
fDate
Aug. 31 2010-Sept. 3 2010
Firstpage
197
Lastpage
204
Abstract
We investigate the problem of reinforcement learning (RL) in a challenging object-oriented environment, where the functional diversity of objects is high, and the agent must learn quickly by generalizing its experience to novel situations. We present a novel two-layer architecture, which can achieve efficient learning of value function for such environments. The algorithm is implemented by integrating an unsupervised, hierarchical clustering component into the Soar cognitive architecture. Our system coherently incorporates several principles in machine learning and knowledge representation including: dimension reduction, competitive learning, hierarchical representation and sparse coding. We also explore the types of prior domain knowledge that can be used to regulate learning based on the characteristics of environment. The system is empirically evaluated in an artificial domain consisting of interacting objects with diverse functional properties and multiple functional roles. The results demonstrate that the flexibility of hierarchical representation naturally integrates with our novel value function approximation scheme and together they can significantly improve the speed of RL.
Keywords
function approximation; knowledge representation; unsupervised learning; Soar cognitive architecture; competitive learning; dimension reduction; hierarchical clustering; hierarchical representation; knowledge representation; machine learning; object-oriented environment; reinforcement learning agent; sparse coding; unsupervised hierarchical categorization; value function approximation; Function approximation; Lattices; Learning; Learning systems; Sensitivity; Sensors; Weapons; cognitive architecture; reinforcement learnign; unsupervised learning; value function approximation;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location
Toronto, ON
Print_ISBN
978-1-4244-8482-9
Electronic_ISBN
978-0-7695-4191-4
Type
conf
DOI
10.1109/WI-IAT.2010.16
Filename
5614166
Link To Document