DocumentCode
189192
Title
Individual versus Difference Rewards on Reinforcement Learning for Route Choice
Author
Grunitzki, Ricardo ; De Oliveira Ramos, Gabriel ; Cetertich Bazzan, Ana Lucia
Author_Institution
Inst. de Inf./PPGC, UFRGS, Porto Alegre, Brazil
fYear
2014
fDate
18-22 Oct. 2014
Firstpage
253
Lastpage
258
Abstract
In transportation systems, drivers usually choose their routes based on their own knowledge about the network. Such a knowledge is obtained from drivers´ previous trips. When drivers are faced with jams they may change their routes to take a faster path. But this re-routing may not be a good choice because other drivers can proceed in the same way. Furthermore, such behaviour can create jams in other links. On the other hand, if drivers build their routes aiming at maximizing the overall travel time (system´s utility), rather than their individual travel time (agents´ utility), the whole system may benefit. This work presents two reinforcement learning algorithms for solving the route choice problem in road networks. The IQ-learning uses an individual reward function, which aims at finding a policy that maximizes the agents´ utility. On the other hand, DQ-learning algorithm shapes the agents´ reward based on difference rewards function, and aims at finding a route that maximizes the system´s utility. Through experiments we show that DQ-learning is able to reduce the overall travel time when compared to other methods.
Keywords
driver information systems; learning (artificial intelligence); road traffic; vehicle routing; DQ-learning algorithm; IQ-learning algorithm; difference reward function; individual reward function; reinforcement learning algorithms; road networks; route choice problem; transportation systems; Abstracts; Convergence; Heuristic algorithms; Learning (artificial intelligence); Roads; Vehicles; difference rewards; multiagent systems; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems (BRACIS), 2014 Brazilian Conference on
Conference_Location
Sao Paulo
Type
conf
DOI
10.1109/BRACIS.2014.53
Filename
6984839
Link To Document