DocumentCode :
1734397
Title :
Coordinated Reinforcement Learning Agents in a Multi-agent Virtual Environment
Author :
Sause, William
Author_Institution :
Grad. Sch. of Comput. & Inf. Sci., Nova Southeastern Univ., Fort Lauderdale, FL, USA
Volume :
1
fYear :
2013
Firstpage :
227
Lastpage :
230
Abstract :
This research presents a framework for coordinating multiple intelligent agents within a single virtual environment. Coordination is accomplished via a "next available agent" scheme while learning is achieved through the use of the Q-learning and Sarsa temporal difference reinforcement learning algorithms. To assess the effectiveness of each learning algorithm, experiments were conducted that measured an agent\´s ability to learn tasks in a static and dynamic environment while using both a fixed (FEP) and variable (VEP) ϵ-greedy probability rate. Results show that Sarsa, on average, outperformed Q-learning in almost all experiments. Overall, VEP resulted in higher percentages of successes and optimal successes than FEP, and showed convergence to the optimal policy when measuring the average number of time steps per episode.
Keywords :
convergence; greedy algorithms; learning (artificial intelligence); multi-agent systems; probability; FEP; Q-learning algorithm; Sarsa temporal difference reinforcement learning algorithm; VEP; agent task learning ability measurement; convergence; coordinated reinforcement learning agents; dynamic environment; fixed ϵ-greedy probability rate; multiagent virtual environment; multiple intelligent agent coordination; next-available agent scheme; optimal policy; static environment; variable ϵ-greedy probability rate; Convergence; Educational institutions; Heuristic algorithms; Intelligent agents; Learning (artificial intelligence); Time measurement; Virtual environments; Reinforcement learning; intelligent agents; virtual environments;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
Type :
conf
DOI :
10.1109/ICMLA.2013.46
Filename :
6784616
Link To Document :
بازگشت