DocumentCode :
2365860
Title :
Reinforcement learning with average cost for adaptive control of traffic lights at intersections
Author :
Prashanth, L.A. ; Bhatnagar, Shalabh
Author_Institution :
Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India
fYear :
2011
fDate :
5-7 Oct. 2011
Firstpage :
1640
Lastpage :
1645
Abstract :
We propose for the first time two reinforcement learning algorithms with function approximation for average cost adaptive control of traffic lights. One of these algorithms is a version of Q-learning with function approximation while the other is a policy gradient actor-critic algorithm that incorporates multi-timescale stochastic approximation. We show performance comparisons on various network settings of these algorithms with a range of fixed timing algorithms, as well as a Q-learning algorithm with full state representation that we also implement. We observe that whereas (as expected) on a two-junction corridor, the full state representation algorithm shows the best results, this algorithm is not implementable on larger road networks. The algorithm PG-AC-TLC that we propose is seen to show the best overall performance.
Keywords :
adaptive control; approximation theory; function approximation; learning (artificial intelligence); road traffic; stochastic processes; traffic control; traffic engineering computing; Q-learning algorithm; adaptive control; average cost; full state representation algorithm; function approximation; multitimescale stochastic approximation; policy gradient actor-critic algorithm; reinforcement learning; traffic lights; two-junction corridor; Algorithm design and analysis; Approximation algorithms; Cost function; Function approximation; Junctions; Learning; Roads; Q-learning; Traffic signal control; policy gradient actor-critic; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Transportation Systems (ITSC), 2011 14th International IEEE Conference on
Conference_Location :
Washington, DC
ISSN :
2153-0009
Print_ISBN :
978-1-4577-2198-4
Type :
conf
DOI :
10.1109/ITSC.2011.6082823
Filename :
6082823
Link To Document :
بازگشت