• DocumentCode
    1561566
  • Title

    A new multi-agent reinforcement learning algorithm and its application in wastewater reclamation by IBAC reactor

  • Author

    Yang, Haiyan ; Ma, Fang ; Cui, Fuyi ; Zhong, Yu

  • Author_Institution
    Sch. of Municipal & Environ. Eng., Harbin Inst. of Technol., China
  • Volume
    3
  • fYear
    2004
  • Firstpage
    2671
  • Abstract
    In multi-agent systems, joint-action must be employed to achieve cooperation because the evaluation to the behavior of an agent often depends on the other agents´ behaviors. However, joint-action reinforcement learning suffers the slow convergence rate because of the enormous learning space produced by joint-action. In this article, a prediction-based reinforcement learning algorithm is presented for multi-agent cooperation tasks, which demands all agents to learn predicting the probabilities of actions that other agents may execute. An Immobilized Biological Activated Carbon (IBAC) reactor is run to test the efficacy of the new algorithm, and the result shows that the new algorithm can achieve high biodegradation efficiency much faster than the primitive reinforcement learning algorithm.
  • Keywords
    adaptive control; bioreactors; convergence; intelligent control; learning (artificial intelligence); learning systems; multi-agent systems; probability; wastewater treatment; biodegradation efficiency; convergence rate; immobilized biological activated carbon reactor; joint action reinforcement learning; multiagent cooperation tasks; multiagent reinforcement learning; multiagent systems; probability; wastewater reclamation; Acceleration; Collaboration; Convergence; Inductors; Machine learning algorithms; Multiagent systems; Prediction algorithms; Space technology; Testing; Wastewater;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on
  • Print_ISBN
    0-7803-8273-0
  • Type

    conf

  • DOI
    10.1109/WCICA.2004.1342082
  • Filename
    1342082