• DocumentCode
    1883375
  • Title

    A study of reinforcement learning for the robot with many degrees of freedom - acquisition of locomotion patterns for multi-legged robot

  • Author

    Ito, Kazuyuki ; Matsuno, Fumitoshi

  • Author_Institution
    Dept. of Computational Intelligence & Syst. Sci., Tokyo Inst. of Technol., Yokohama, Japan
  • Volume
    4
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    3392
  • Abstract
    Reinforcement learning has recently been receiving much attention as a learning method for not only toy problems but also complicated systems such as robot systems. It does not need priori knowledge and has higher capability of reactive and adaptive behaviors. However, increasing of action-state space makes it difficult to accomplish the learning process. In most of the previous works, the application of the learning is restricted to simple tasks with a small action-state space. Considering this point, we present a new reinforcement learning algorithm: Q-learning with dynamic structuring of exploration space based on genetic algorithm. The algorithm is applicable to systems with high dimensional action and interior state spaces, for example, a robot with many redundant degrees of freedom. To demonstrate the effectiveness of the proposed algorithm simulations of locomotion patterns for a 12-leged robot were carried out. As the result, an effective behavior was obtained by using our proposed algorithm.
  • Keywords
    genetic algorithms; learning (artificial intelligence); legged locomotion; robot dynamics; Q-learning; action-state space; dynamics; genetic algorithm; legged locomotion; locomotion pattern; multiple legged robot; reinforcement learning; Computational intelligence; Genetic algorithms; Heuristic algorithms; Indium tin oxide; Intelligent robots; Learning systems; Legged locomotion; Orbital robotics; Space exploration; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 2002. Proceedings. ICRA '02. IEEE International Conference on
  • Print_ISBN
    0-7803-7272-7
  • Type

    conf

  • DOI
    10.1109/ROBOT.2002.1014235
  • Filename
    1014235