• DocumentCode
    390899
  • Title

    Empirical comparison of various reinforcement learning strategies for sequential targeted marketing

  • Author

    Abe, Naoki ; Pednault, Edwin ; Wang, Haixun ; Zadrozny, Bianca ; Fan, Wei ; Apte, Chid

  • Author_Institution
    Math. Sci. Dept., IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    3
  • Lastpage
    10
  • Abstract
    We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semidirect" methods that fall between them. We conduct a number of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system\´s modeling parameters have restricted attention, the indirect methods\´ performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies.
  • Keywords
    data mining; decision theory; learning (artificial intelligence); marketing; cost-sensitive learning; data mining; decision making; performance; reinforcement learning; sequential targeted marketing; targeted marketing; Costs; Data mining; Decision making; Degradation; History; Learning systems; Mirrors; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183879
  • Filename
    1183879