DocumentCode :
397875
Title :
Q-ac: multiagent reinforcement learning with perception-conversion action
Author :
Sun, Ruoying ; Tatsum, Shoji ; Zhao, Gang
Author_Institution :
Fac. of Eng., Osaka City Univ., Japan
Volume :
3
fYear :
2003
fDate :
5-8 Oct. 2003
Firstpage :
2950
Abstract :
For the task under Markov Decision Process, this paper presents a novel multiagent Reinforcement Learning (RL) with perception and conversion action mechanism that learning agents observe adversary agent and convert adversarial action to learning agents´ corresponding action as observing state variation incurred by the adversary agent in the task environment during learning processes. Meanwhile, this paper surveys inexpensive communication ways among learning agents utilizing both the direct communication and the indirect media communication to realize agents´ cooperation. The direct communication is realized by sharing sensation; the indirect media communication is realized by updating reinforcement values on the common environment observation. Then, a multiagent RL algorithm, Q-ac multiagent RL method, is proposed. By perception and conversion action, the learning agents extend learning episodes and derive more observation by less action. The direct communication enhances agents´ observation ability to the environment, and the indirect media communication improves agents´ ability deriving the optimal action policy. The simulation results on hunter game demonstrate the efficiency of the proposed method.
Keywords :
Markov processes; learning (artificial intelligence); multi-agent systems; Markov decision process; Q-ac multiagent RL method; Q-ac multiagent reinforcement learning method; hunter game; media communication; perception-conversion action mechanism; Control system synthesis; Delay; Design methodology; Learning systems; Stochastic processes; Sun;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-7952-7
Type :
conf
DOI :
10.1109/ICSMC.2003.1244340
Filename :
1244340
Link To Document :
بازگشت