• DocumentCode
    320660
  • Title

    Region-based Q-learning using convex clustering approach

  • Author

    Kim, S.H. ; Suh, I.H. ; Oh, S.R. ; Cho, Y.J. ; Chung, Y.K.

  • Author_Institution
    Intelligent Control & Robotics Lab., Hanyang Univ., Kyeongki, South Korea
  • Volume
    2
  • fYear
    1997
  • fDate
    7-11 Sep 1997
  • Firstpage
    601
  • Abstract
    For continuous state space applications, a novel method of Q-learning is proposed, where the method incorporates a region-based reward assignment being used to solve a structural credit assignment problem and a convex clustering approach to find a region with the same reward attribution property. Our learning method can estimate a current Q-value of an arbitrarily given state by using effect functions, and has the ability to learn its actions similar to that of Q-learning. Thus, our method enables robots to move smoothly in a real environment. To show the validity of our method, the proposed Q-learning method is compared with conventional Q-learning method through a simple two dimensional free space navigation problem, and visual tracking simulation results involving a 2-DOF SCARA robot are also presented
  • Keywords
    learning (artificial intelligence); path planning; pattern recognition; robots; 2-DOF SCARA robot; continuous state space; convex clustering approach; region-based Q-learning; region-based reward assignment; structural credit assignment problem; two dimensional free space navigation problem; visual tracking simulation; Discrete event simulation; Humans; Intelligent robots; Learning systems; Mechanical engineering; Mechanical factors; Motion planning; Orbital robotics; State estimation; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 1997. IROS '97., Proceedings of the 1997 IEEE/RSJ International Conference on
  • Conference_Location
    Grenoble
  • Print_ISBN
    0-7803-4119-8
  • Type

    conf

  • DOI
    10.1109/IROS.1997.655073
  • Filename
    655073