DocumentCode
290676
Title
Extended stochastic reinforcement learning for the acquisition of cooperative motion plans for dynamically constrained agents
Author
Mikami, Sadayoshi ; Kakazu, Yukinori
Author_Institution
Fac. of Eng., Hokkaido Univ., Sapporo, Japan
fYear
1993
fDate
17-20 Oct 1993
Firstpage
257
Abstract
This paper examines the problem of the acquisition of coordinated task plans for a group of autonomous agents. The authors deal with cases when intelligent mobile robots are on a seesaw and they are trying to balance the seesaw. The objective of the agents is to maximize the global optimization function under the constraints that the effect of their decision is propagated after a certain time delay. To cope with such a situation, this paper proposes extensions to the learning automata type reinforcement learning methods. One is the group learning method. It generates teaching signals that are robust to the delay of the result of an action. Another is the genetic reinforcement learning phase. This is intended to give hard-wired knowledge to all the agents through the meta-learning phase. A sensor information compression function is acquired as the knowledge and a genetic algorithm is used for the search mechanism. The authors demonstrate how the cooperative plans can be acquired for seesaw balancing problem where conventional reinforcement learning could not achieve its balance
Keywords
cooperative systems; genetic algorithms; learning (artificial intelligence); learning automata; mobile robots; optimisation; stochastic automata; autonomous agents; cooperative motion plans; coordinated task plans; dynamically constrained agents; extended stochastic reinforcement learning; genetic reinforcement learning; global optimization function; group learning method; hard-wired knowledge; intelligent mobile robots; learning automata; meta-learning phase; seesaw balancing problem; sensor information compression function; teaching signals; Autonomous agents; Constraint optimization; Delay effects; Education; Intelligent robots; Learning automata; Learning systems; Mobile robots; Signal generators; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 1993. 'Systems Engineering in the Service of Humans', Conference Proceedings., International Conference on
Conference_Location
Le Touquet
Print_ISBN
0-7803-0911-1
Type
conf
DOI
10.1109/ICSMC.1993.390719
Filename
390719
Link To Document