Extended stochastic reinforcement learning for the acquisition of cooperative motion plans for dynamically constrained agents

Author

Mikami, Sadayoshi ; Kakazu, Yukinori

Author_Institution

Fac. of Eng., Hokkaido Univ., Sapporo, Japan

fYear

1993

fDate

17-20 Oct 1993

Firstpage

257

Abstract

This paper examines the problem of the acquisition of coordinated task plans for a group of autonomous agents. The authors deal with cases when intelligent mobile robots are on a seesaw and they are trying to balance the seesaw. The objective of the agents is to maximize the global optimization function under the constraints that the effect of their decision is propagated after a certain time delay. To cope with such a situation, this paper proposes extensions to the learning automata type reinforcement learning methods. One is the group learning method. It generates teaching signals that are robust to the delay of the result of an action. Another is the genetic reinforcement learning phase. This is intended to give hard-wired knowledge to all the agents through the meta-learning phase. A sensor information compression function is acquired as the knowledge and a genetic algorithm is used for the search mechanism. The authors demonstrate how the cooperative plans can be acquired for seesaw balancing problem where conventional reinforcement learning could not achieve its balance

Keywords

cooperative systems; genetic algorithms; learning (artificial intelligence); learning automata; mobile robots; optimisation; stochastic automata; autonomous agents; cooperative motion plans; coordinated task plans; dynamically constrained agents; extended stochastic reinforcement learning; genetic reinforcement learning; global optimization function; group learning method; hard-wired knowledge; intelligent mobile robots; learning automata; meta-learning phase; seesaw balancing problem; sensor information compression function; teaching signals; Autonomous agents; Constraint optimization; Delay effects; Education; Intelligent robots; Learning automata; Learning systems; Mobile robots; Signal generators; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics, 1993. 'Systems Engineering in the Service of Humans', Conference Proceedings., International Conference on

Conference_Location

Le Touquet

Print_ISBN

0-7803-0911-1

Type

conf

DOI

10.1109/ICSMC.1993.390719

Filename

390719