DocumentCode
3485368
Title
Application of reinforcement learning in dynamic pricing algorithms
Author
Jintian, Wang ; Lei, Zhou
Author_Institution
Dept. of Comput. Sci. & Technol., Hefei Univ. of Technol., Hefei, China
fYear
2009
fDate
5-7 Aug. 2009
Firstpage
419
Lastpage
423
Abstract
This paper is concerned with the dynamic pricing problems of a duopoly case in electronic retail markets. Combined with the concept of performance potential, the simulated annealing Q-learning (SA-Q) and the win-or-learn-fast policy hill climbing algorithm (WoLF-PHC) are used to solve the learning problems of multi-agent systems with either average- or discounted-reward criteria, under the case that only partial information about the opponent is known. The simulation results show that the WoLF-PHC algorithm performs well in adapting environment´s change and in deriving better learning values than the SA-Q algorithm.
Keywords
learning (artificial intelligence); multi-agent systems; pricing; retailing; simulated annealing; WoLF-PHC algorithm; average-reward criteria; discounted-reward criteria; duopoly; dynamic pricing algorithm; electronic retail market; multiagent system; performance potential; reinforcement learning; simulated annealing Q-learning; win-or-learn-fast policy hill climbing algorithm; Application software; Automation; Computational modeling; Computer science; Consumer electronics; Heuristic algorithms; Learning; Logistics; Pricing; Simulated annealing; WoLF-PHC; multi-agent; performance potential; simulated annealing Q-learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Automation and Logistics, 2009. ICAL '09. IEEE International Conference on
Conference_Location
Shenyang
Print_ISBN
978-1-4244-4794-7
Electronic_ISBN
978-1-4244-4795-4
Type
conf
DOI
10.1109/ICAL.2009.5262885
Filename
5262885
Link To Document