Efficient Value Function Approximation with Unsupervised Hierarchical Categorization for a Reinforcement Learning Agent

Author

Wang, Yongjia ; Laird, John E.

Author_Institution

EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA

Volume

2

fYear

2010

fDate

Aug. 31 2010-Sept. 3 2010

Firstpage

197

Lastpage

204

Abstract

We investigate the problem of reinforcement learning (RL) in a challenging object-oriented environment, where the functional diversity of objects is high, and the agent must learn quickly by generalizing its experience to novel situations. We present a novel two-layer architecture, which can achieve efficient learning of value function for such environments. The algorithm is implemented by integrating an unsupervised, hierarchical clustering component into the Soar cognitive architecture. Our system coherently incorporates several principles in machine learning and knowledge representation including: dimension reduction, competitive learning, hierarchical representation and sparse coding. We also explore the types of prior domain knowledge that can be used to regulate learning based on the characteristics of environment. The system is empirically evaluated in an artificial domain consisting of interacting objects with diverse functional properties and multiple functional roles. The results demonstrate that the flexibility of hierarchical representation naturally integrates with our novel value function approximation scheme and together they can significantly improve the speed of RL.

Keywords

function approximation; knowledge representation; unsupervised learning; Soar cognitive architecture; competitive learning; dimension reduction; hierarchical clustering; hierarchical representation; knowledge representation; machine learning; object-oriented environment; reinforcement learning agent; sparse coding; unsupervised hierarchical categorization; value function approximation; Function approximation; Lattices; Learning; Learning systems; Sensitivity; Sensors; Weapons; cognitive architecture; reinforcement learnign; unsupervised learning; value function approximation;

fLanguage

English

Publisher

ieee

Conference_Titel

Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on

Conference_Location

Toronto, ON

Print_ISBN

978-1-4244-8482-9

Electronic_ISBN

978-0-7695-4191-4

Type

conf

DOI

10.1109/WI-IAT.2010.16

Filename

5614166