• DocumentCode
    3467060
  • Title

    Continuous valued Q-learning for vision-guided behavior acquisition

  • Author

    Takahashi, Yasutake ; Takeda, Masanori ; Asada, Minoru

  • Author_Institution
    Dept. of Adaptive Machine Syst., Osaka Univ., Japan
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    255
  • Lastpage
    260
  • Abstract
    Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to real robot tasks because of poor performance of learned behavior and a further problem of state space construction. This paper proposes a continuous valued Q-learning for real robot applications, which calculates the contribution values for estimating a continuous action value in order to make motion smooth and effective. The proposed method obtained a better performance of desired behavior than the conventional real-valued Q-learning method, with roughly quantized state and action. To show the validity of the method, we applied the method to a vision-guided mobile robot of which the task is to chase a ball. Although the task was simple, the performance was quite impressive. A further improvement is discussed
  • Keywords
    learning (artificial intelligence); mobile robots; motion control; robot vision; continuous action value; continuous valued Q-learning; mobile robots; motion control; reinforcement learning; vision-guided control; Dynamic programming; Learning; Mobile robots; Orbital robotics; Robot vision systems; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multisensor Fusion and Integration for Intelligent Systems, 1999. MFI '99. Proceedings. 1999 IEEE/SICE/RSJ International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-7803-5801-5
  • Type

    conf

  • DOI
    10.1109/MFI.1999.815999
  • Filename
    815999