مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement Learning With Function Approximation for Traffic Signal Control

DocumentCode :

1395479

Title :

Reinforcement Learning With Function Approximation for Traffic Signal Control

Author :

Prashanth, L.A. ; Bhatnagar, Shalabh

Author_Institution :

Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India

Volume :

Issue :

fYear :

2011

fDate :

6/1/2011 12:00:00 AM

Firstpage :

412

Lastpage :

421

Abstract :

We propose, for the first time, a reinforcement learning (RL) algorithm with function approximation for traffic signal control. Our algorithm incorporates state-action features and is easily implementable in high-dimensional settings. Prior work, e.g., the work of Abdulhai , on the application of RL to traffic signal control requires full-state representations and cannot be implemented, even in moderate-sized road networks, because the computational complexity exponentially grows in the numbers of lanes and junctions. We tackle this problem of the curse of dimensionality by effectively using feature-based state representations that use a broad characterization of the level of congestion as low, medium, or high. One advantage of our algorithm is that, unlike prior work based on RL, it does not require precise information on queue lengths and elapsed times at each lane but instead works with the aforementioned described features. The number of features that our algorithm requires is linear to the number of signaled lanes, thereby leading to several orders of magnitude reduction in the computational complexity. We perform implementations of our algorithm on various settings and show performance comparisons with other algorithms in the literature, including the works of Abdulhai and Cools , as well as the fixed-timing and the longest queue algorithms. For comparison, we also develop an RL algorithm that uses full-state representation and incorporates prioritization of traffic, unlike the work of Abdulhai We observe that our algorithm outperforms all the other algorithms on all the road network settings that we consider.

Keywords :

function approximation; learning (artificial intelligence); traffic control; traffic engineering computing; computational complexity; curse of dimensionality; feature-based state representations; full-state representation; function approximation; magnitude reduction; moderate-sized road networks; reinforcement learning; traffic signal control; Approximation algorithms; Equations; Function approximation; Junctions; Roads; Software algorithms; Timing; Q-learning with full-state representation (QTLC-FS); Q-learning with function approximation (QTLC-FA); reinforcement learning (RL); traffic signal control;

fLanguage :

English

Journal_Title :

Intelligent Transportation Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

1524-9050

Type :

jour

DOI :

10.1109/TITS.2010.2091408

Filename :

5658157

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1395479