• DocumentCode
    530999
  • Title

    Efficient Value Function Approximation with Unsupervised Hierarchical Categorization for a Reinforcement Learning Agent

  • Author

    Wang, Yongjia ; Laird, John E.

  • Author_Institution
    EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
  • Volume
    2
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 3 2010
  • Firstpage
    197
  • Lastpage
    204
  • Abstract
    We investigate the problem of reinforcement learning (RL) in a challenging object-oriented environment, where the functional diversity of objects is high, and the agent must learn quickly by generalizing its experience to novel situations. We present a novel two-layer architecture, which can achieve efficient learning of value function for such environments. The algorithm is implemented by integrating an unsupervised, hierarchical clustering component into the Soar cognitive architecture. Our system coherently incorporates several principles in machine learning and knowledge representation including: dimension reduction, competitive learning, hierarchical representation and sparse coding. We also explore the types of prior domain knowledge that can be used to regulate learning based on the characteristics of environment. The system is empirically evaluated in an artificial domain consisting of interacting objects with diverse functional properties and multiple functional roles. The results demonstrate that the flexibility of hierarchical representation naturally integrates with our novel value function approximation scheme and together they can significantly improve the speed of RL.
  • Keywords
    function approximation; knowledge representation; unsupervised learning; Soar cognitive architecture; competitive learning; dimension reduction; hierarchical clustering; hierarchical representation; knowledge representation; machine learning; object-oriented environment; reinforcement learning agent; sparse coding; unsupervised hierarchical categorization; value function approximation; Function approximation; Lattices; Learning; Learning systems; Sensitivity; Sensors; Weapons; cognitive architecture; reinforcement learnign; unsupervised learning; value function approximation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Toronto, ON
  • Print_ISBN
    978-1-4244-8482-9
  • Electronic_ISBN
    978-0-7695-4191-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2010.16
  • Filename
    5614166