DocumentCode
1561566
Title
A new multi-agent reinforcement learning algorithm and its application in wastewater reclamation by IBAC reactor
Author
Yang, Haiyan ; Ma, Fang ; Cui, Fuyi ; Zhong, Yu
Author_Institution
Sch. of Municipal & Environ. Eng., Harbin Inst. of Technol., China
Volume
3
fYear
2004
Firstpage
2671
Abstract
In multi-agent systems, joint-action must be employed to achieve cooperation because the evaluation to the behavior of an agent often depends on the other agents´ behaviors. However, joint-action reinforcement learning suffers the slow convergence rate because of the enormous learning space produced by joint-action. In this article, a prediction-based reinforcement learning algorithm is presented for multi-agent cooperation tasks, which demands all agents to learn predicting the probabilities of actions that other agents may execute. An Immobilized Biological Activated Carbon (IBAC) reactor is run to test the efficacy of the new algorithm, and the result shows that the new algorithm can achieve high biodegradation efficiency much faster than the primitive reinforcement learning algorithm.
Keywords
adaptive control; bioreactors; convergence; intelligent control; learning (artificial intelligence); learning systems; multi-agent systems; probability; wastewater treatment; biodegradation efficiency; convergence rate; immobilized biological activated carbon reactor; joint action reinforcement learning; multiagent cooperation tasks; multiagent reinforcement learning; multiagent systems; probability; wastewater reclamation; Acceleration; Collaboration; Convergence; Inductors; Machine learning algorithms; Multiagent systems; Prediction algorithms; Space technology; Testing; Wastewater;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on
Print_ISBN
0-7803-8273-0
Type
conf
DOI
10.1109/WCICA.2004.1342082
Filename
1342082
Link To Document