• DocumentCode
    447258
  • Title

    An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems

  • Author

    Kawakami, Takashi ; Kinoshita, Masahiro ; Watanabe, Michiko ; Takatori, Norihiko ; Furukawa, Masashi

  • Author_Institution
    Dept. of Inf. Design, Hokkaido Inst. of Technol., Sapporo, Japan
  • Volume
    1
  • fYear
    2005
  • fDate
    10-12 Oct. 2005
  • Firstpage
    109
  • Abstract
    This paper proposes a new approach to realize a reinforcement learning scheme for autonomous multiple agents system. In our approach, we treat the cooperative agents systems in which there are multiple autonomous mobile robots, and the seesaw balancing task is given. This problem is an example of corresponding tasks to find the appropriate locations for multiple mobile robots. Each robot agent on a seesaw keeps being balanced state. As a most useful algorithm, the Q-learning method is well known. However, feasible action values of robot agents must be categorized into some discrete action values. Therefore, in this study, the actor-critic method is applied to treat continuous values of agents´ actions. Each robot agent has a set of normal distribution, that determines a distance of the robot movement for a corresponding state of the seesaw system. Based on a result of movement in this system, the normal distribution is modified by actor-critic learning method. The simulation result shows the effectiveness of our approaching method.
  • Keywords
    learning (artificial intelligence); mobile robots; multi-agent systems; normal distribution; Q-learning; actor-critic approach; autonomous multiple agent; cooperative behavior learning; multiagent seesaw balancing problem; multiagent systems; multiple autonomous mobile robot; multiple mobile robot; reinforcement learning; robot agent; robot movement; Autonomous agents; Educational institutions; Environmental management; Gaussian distribution; Human robot interaction; Learning systems; Mobile robots; Multiagent systems; Positron emission tomography; Service robots; Actor-critic; cooperative behaviors; multiagent systems; reinforcement learning; seesaw balancing problems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9298-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.2005.1571130
  • Filename
    1571130