• DocumentCode
    290676
  • Title

    Extended stochastic reinforcement learning for the acquisition of cooperative motion plans for dynamically constrained agents

  • Author

    Mikami, Sadayoshi ; Kakazu, Yukinori

  • Author_Institution
    Fac. of Eng., Hokkaido Univ., Sapporo, Japan
  • fYear
    1993
  • fDate
    17-20 Oct 1993
  • Firstpage
    257
  • Abstract
    This paper examines the problem of the acquisition of coordinated task plans for a group of autonomous agents. The authors deal with cases when intelligent mobile robots are on a seesaw and they are trying to balance the seesaw. The objective of the agents is to maximize the global optimization function under the constraints that the effect of their decision is propagated after a certain time delay. To cope with such a situation, this paper proposes extensions to the learning automata type reinforcement learning methods. One is the group learning method. It generates teaching signals that are robust to the delay of the result of an action. Another is the genetic reinforcement learning phase. This is intended to give hard-wired knowledge to all the agents through the meta-learning phase. A sensor information compression function is acquired as the knowledge and a genetic algorithm is used for the search mechanism. The authors demonstrate how the cooperative plans can be acquired for seesaw balancing problem where conventional reinforcement learning could not achieve its balance
  • Keywords
    cooperative systems; genetic algorithms; learning (artificial intelligence); learning automata; mobile robots; optimisation; stochastic automata; autonomous agents; cooperative motion plans; coordinated task plans; dynamically constrained agents; extended stochastic reinforcement learning; genetic reinforcement learning; global optimization function; group learning method; hard-wired knowledge; intelligent mobile robots; learning automata; meta-learning phase; seesaw balancing problem; sensor information compression function; teaching signals; Autonomous agents; Constraint optimization; Delay effects; Education; Intelligent robots; Learning automata; Learning systems; Mobile robots; Signal generators; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 1993. 'Systems Engineering in the Service of Humans', Conference Proceedings., International Conference on
  • Conference_Location
    Le Touquet
  • Print_ISBN
    0-7803-0911-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1993.390719
  • Filename
    390719