DocumentCode
390899
Title
Empirical comparison of various reinforcement learning strategies for sequential targeted marketing
Author
Abe, Naoki ; Pednault, Edwin ; Wang, Haixun ; Zadrozny, Bianca ; Fan, Wei ; Apte, Chid
Author_Institution
Math. Sci. Dept., IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2002
fDate
2002
Firstpage
3
Lastpage
10
Abstract
We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semidirect" methods that fall between them. We conduct a number of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system\´s modeling parameters have restricted attention, the indirect methods\´ performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies.
Keywords
data mining; decision theory; learning (artificial intelligence); marketing; cost-sensitive learning; data mining; decision making; performance; reinforcement learning; sequential targeted marketing; targeted marketing; Costs; Data mining; Decision making; Degradation; History; Learning systems; Mirrors; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-1754-4
Type
conf
DOI
10.1109/ICDM.2002.1183879
Filename
1183879
Link To Document