A new multi-agent reinforcement learning algorithm and its application in wastewater reclamation by IBAC reactor

Author

Yang, Haiyan ; Ma, Fang ; Cui, Fuyi ; Zhong, Yu

Author_Institution

Sch. of Municipal & Environ. Eng., Harbin Inst. of Technol., China

Volume

3

fYear

2004

Firstpage

2671

Abstract

In multi-agent systems, joint-action must be employed to achieve cooperation because the evaluation to the behavior of an agent often depends on the other agents´ behaviors. However, joint-action reinforcement learning suffers the slow convergence rate because of the enormous learning space produced by joint-action. In this article, a prediction-based reinforcement learning algorithm is presented for multi-agent cooperation tasks, which demands all agents to learn predicting the probabilities of actions that other agents may execute. An Immobilized Biological Activated Carbon (IBAC) reactor is run to test the efficacy of the new algorithm, and the result shows that the new algorithm can achieve high biodegradation efficiency much faster than the primitive reinforcement learning algorithm.

Keywords

adaptive control; bioreactors; convergence; intelligent control; learning (artificial intelligence); learning systems; multi-agent systems; probability; wastewater treatment; biodegradation efficiency; convergence rate; immobilized biological activated carbon reactor; joint action reinforcement learning; multiagent cooperation tasks; multiagent reinforcement learning; multiagent systems; probability; wastewater reclamation; Acceleration; Collaboration; Convergence; Inductors; Machine learning algorithms; Multiagent systems; Prediction algorithms; Space technology; Testing; Wastewater;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on

Print_ISBN

0-7803-8273-0

Type

conf

DOI

10.1109/WCICA.2004.1342082

Filename

1342082