Title :
Extended stochastic reinforcement learning for the acquisition of cooperative motion plans for dynamically constrained agents
Author :
Mikami, Sadayoshi ; Kakazu, Yukinori
Author_Institution :
Fac. of Eng., Hokkaido Univ., Sapporo, Japan
Abstract :
This paper examines the problem of the acquisition of coordinated task plans for a group of autonomous agents. The authors deal with cases when intelligent mobile robots are on a seesaw and they are trying to balance the seesaw. The objective of the agents is to maximize the global optimization function under the constraints that the effect of their decision is propagated after a certain time delay. To cope with such a situation, this paper proposes extensions to the learning automata type reinforcement learning methods. One is the group learning method. It generates teaching signals that are robust to the delay of the result of an action. Another is the genetic reinforcement learning phase. This is intended to give hard-wired knowledge to all the agents through the meta-learning phase. A sensor information compression function is acquired as the knowledge and a genetic algorithm is used for the search mechanism. The authors demonstrate how the cooperative plans can be acquired for seesaw balancing problem where conventional reinforcement learning could not achieve its balance
Keywords :
cooperative systems; genetic algorithms; learning (artificial intelligence); learning automata; mobile robots; optimisation; stochastic automata; autonomous agents; cooperative motion plans; coordinated task plans; dynamically constrained agents; extended stochastic reinforcement learning; genetic reinforcement learning; global optimization function; group learning method; hard-wired knowledge; intelligent mobile robots; learning automata; meta-learning phase; seesaw balancing problem; sensor information compression function; teaching signals; Autonomous agents; Constraint optimization; Delay effects; Education; Intelligent robots; Learning automata; Learning systems; Mobile robots; Signal generators; Stochastic processes;
Conference_Titel :
Systems, Man and Cybernetics, 1993. 'Systems Engineering in the Service of Humans', Conference Proceedings., International Conference on
Conference_Location :
Le Touquet
Print_ISBN :
0-7803-0911-1
DOI :
10.1109/ICSMC.1993.390719