• DocumentCode
    1306136
  • Title

    Urban traffic signal control using reinforcement learning agents

  • Author

    Balaji, P.G. ; German, X. ; Srinivasan, Dipti

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
  • Volume
    4
  • Issue
    3
  • fYear
    2010
  • fDate
    9/1/2010 12:00:00 AM
  • Firstpage
    177
  • Lastpage
    188
  • Abstract
    This study presents a distributed multi-agent-based traffic signal control for optimising green timing in an urban arterial road network to reduce the total travel time and delay experienced by vehicles. The proposed multi-agent architecture uses traffic data collected by sensors at each intersection, stored historical traffic patterns and data communicated from agents in adjacent intersections to compute green time for a phase. The parameters like weights, threshold values used in computing the green time is fine tuned by online reinforcement learning with an objective to reduce overall delay. PARAMICS software was used as a platform to simulate 29 signalised intersection at Central Business District of Singapore and test the performance of proposed multi-agent traffic signal control for different traffic scenarios. The proposed multi-agent reinforcement learning (RLA) signal control showed significant improvement in mean time delay and speed in comparison to other traffic control system like hierarchical multi-agent system (HMS), cooperative ensemble (CE) and actuated control.
  • Keywords
    control engineering computing; delays; learning (artificial intelligence); multi-agent systems; road traffic; sensors; traffic engineering computing; PARAMICS software; actuated control; cooperative ensemble; distributed multiagent; intersection sensors; mean time delay; reinforcement learning; speed; urban arterial road network; urban traffic signal control;
  • fLanguage
    English
  • Journal_Title
    Intelligent Transport Systems, IET
  • Publisher
    iet
  • ISSN
    1751-956X
  • Type

    jour

  • DOI
    10.1049/iet-its.2009.0096
  • Filename
    5558886