Title :
On-line Reinforcement Learning Control for Urban Traffic Signals
Author :
Zhi-yong, Liu ; Feng-wei, Ma
Author_Institution :
Wuyi Univ., Jiangmen
Abstract :
It is quit difficult to archive perfect effects by applying the traditional modeling and control methods to the urban traffic signal control system because of non-linearity, fuzzyness, self-organization and uncertainty in the system. The artificial intelligence technologies may offer a new way to resolve this problem. In allusion to characteristics of the traffic signal control system, this paper proposes an on-line control algorithm based on Dyna-Q reinforcement learning, and utilizes the experiential knowledge gained by the traffic signal control agent in the trial-error process to estimate the model, and then plans the actions in the estimated model, accordingly it can accelerate the iterative process of the Q-learning. This paper adapts TSIS(a microscopic traffic analysis software) to implement the simulation on two traffic trunk roads which consist of 10 intersections. Comparing with fixed-time control, genetic algorithm and Q-learning control algorithm, simulation results indicate that Dyna-Q reinforcement learning algorithm has an obvious superiority.
Keywords :
control nonlinearities; fuzzy control; genetic algorithms; iterative methods; learning (artificial intelligence); self-adjusting systems; traffic control; uncertain systems; Dyna-Q reinforcement learning; Q-learning control algorithm; artificial intelligence technology; fixed-time control; fuzzyness; genetic algorithm; iterative process; microscopic traffic analysis software; nonlinearity; online control algorithm; online reinforcement learning control; self-organization; traffic signal control agent; traffic trunk roads; trial-error process; uncertainty; urban traffic signal control system; Accelerated aging; Artificial intelligence; Control system synthesis; Iterative algorithms; Learning; Microscopy; Signal processing; Signal resolution; Traffic control; Uncertainty; Agent; Dyna-Q algorithm; Reinforcement learning; Urban trunk road coordination control;
Conference_Titel :
Control Conference, 2007. CCC 2007. Chinese
Conference_Location :
Hunan
Print_ISBN :
978-7-81124-055-9
Electronic_ISBN :
978-7-900719-22-5
DOI :
10.1109/CHICC.2006.4347023