DocumentCode :
2369470
Title :
Probabilistic user behavior models
Author :
Manavoglu, Eren ; Pavlov, Dmitry ; Giles, C. Lee
Author_Institution :
Pennsylvania State Univ., University Park, PA, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
203
Lastpage :
210
Abstract :
We present a mixture model based approach for learning individualized behavior models for the Web users. We investigate the use of maximum entropy and Markov mixture models for generating probabilistic behavior models. We first build a global behavior model for the entire population and then personalize this global model for the existing users by assigning each user individual component weights for the mixture model. We then use these individual weights to group the users into behavior model clusters. We show that the clusters generated in this manner are interpretable and able to represent dominant behavior patterns. We conduct offline experiments on around two months worth of data from CiteSeer, an online digital library for computer science research papers currently storing more than 470,000 documents. We show that both maximum entropy and Markov based personal user behavior models are strong predictive models. We also show that maximum entropy based mixture model outperforms Markov mixture models in recognizing complex user behavior patterns.
Keywords :
Internet; Markov processes; data mining; maximum entropy methods; statistical analysis; user modelling; CiteSeer data; Markov mixture models; Web users; global model; maximum entropy; online digital library; probabilistic user behavior models; Association rules; Collaboration; Computer science; Consumer behavior; Data mining; Entropy; Pattern analysis; Pattern recognition; Predictive models; Software libraries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250921
Filename :
1250921
Link To Document :
بازگشت