Title : 
Empirical comparison of various reinforcement learning strategies for sequential targeted marketing
         
        
            Author : 
Abe, Naoki ; Pednault, Edwin ; Wang, Haixun ; Zadrozny, Bianca ; Fan, Wei ; Apte, Chid
         
        
            Author_Institution : 
Math. Sci. Dept., IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
         
        
        
        
        
        
            Abstract : 
We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semidirect" methods that fall between them. We conduct a number of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system\´s modeling parameters have restricted attention, the indirect methods\´ performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies.
         
        
            Keywords : 
data mining; decision theory; learning (artificial intelligence); marketing; cost-sensitive learning; data mining; decision making; performance; reinforcement learning; sequential targeted marketing; targeted marketing; Costs; Data mining; Decision making; Degradation; History; Learning systems; Mirrors; Sampling methods;
         
        
        
        
            Conference_Titel : 
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
         
        
            Print_ISBN : 
0-7695-1754-4
         
        
        
            DOI : 
10.1109/ICDM.2002.1183879