مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-agent reinforcement learning for traffic signal control

DocumentCode :

154890

Title :

Multi-agent reinforcement learning for traffic signal control

Author :

Prabuchandran, K.J. ; Hemanth Kumar, A.N. ; Bhatnagar, Shalabh

Author_Institution :

Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India

fYear :

2014

fDate :

8-11 Oct. 2014

Firstpage :

2529

Lastpage :

2534

Abstract :

Optimal control of traffic lights at junctions or traffic signal control (TSC) is essential for reducing the average delay experienced by the road users amidst the rapid increase in the usage of vehicles. In this paper, we formulate the TSC problem as a discounted cost Markov decision process (MDP) and apply multi-agent reinforcement learning (MARL) algorithms to obtain dynamic TSC policies. We model each traffic signal junction as an independent agent. An agent decides the signal duration of its phases in a round-robin (RR) manner using multi-agent Q-learning with either ε-greedy or UCB [3] based exploration strategies. It updates its Q-factors based on the cost feedback signal received from its neighbouring agents. This feedback signal can be easily constructed and is shown to be effective in minimizing the average delay of the vehicles in the network. We show through simulations over VISSIM that our algorithms perform significantly better than both the standard fixed signal timing (FST) algorithm and the saturation balancing (SAT) algorithm [15] over two real road networks.

Keywords :

Markov processes; decision theory; learning (artificial intelligence); multi-agent systems; optimal control; road traffic control; traffic engineering computing; ε-greedy based exploration strategy; MARL algorithms; MDP; RR manner; TSC problem; UCB based exploration strategy; average delay minimization; average delay reduction; cost feedback signal; discounted cost Markov decision process; dynamic TSC policies; independent agent; multiagent Q-learning; multiagent reinforcement learning algorithms; neighbouring agents; optimal control; round-robin manner; traffic lights; traffic signal control; traffic signal junction; Delays; Heuristic algorithms; Junctions; Q-factor; Roads; Vehicles; Q-learning; UCB; VISSIM; multi-agent reinforcement learning; traffic signal control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Transportation Systems (ITSC), 2014 IEEE 17th International Conference on

Conference_Location :

Qingdao

Type :

conf

DOI :

10.1109/ITSC.2014.6958095

Filename :

6958095

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=154890