Title :
Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework
Author :
Tahifa, Mohammed ; Boumhidi, Jaouad ; Yahyaouy, Ali
Author_Institution :
Comput. Sci. Dept., Sidi Mohamed Ben Abdellah Univ., Fez, Morocco
Abstract :
Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.
Keywords :
learning (artificial intelligence); multi-agent systems; road traffic control; swarm intelligence; traffic engineering computing; cooperative multiagent system; multimodal function; population based method; swarm Q-learning; swarm reinforcement learning; traffic signal control; Computer architecture; Junctions; Learning (artificial intelligence); Multi-agent systems; Particle swarm optimization; Roads; Vehicles; Particle swarm optimization; Q-learning; multi-agent systems; reinforcement learning; traffic signal control;
Conference_Titel :
Intelligent Systems and Computer Vision (ISCV), 2015
Conference_Location :
Fez
Print_ISBN :
978-1-4799-7510-5
DOI :
10.1109/ISACV.2015.7105536