Title :
An Efficient Solution to Factor Drifting Problem in the pLSA Model
Author :
Zhang, Liang ; Li, Chaoran ; Xu, Yanfei ; Shi, Baile
Author_Institution :
Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
Abstract :
Probabilistic latent semantic analysis (pLSA) is a powerful statistical technique to analyze relation between factors in dyadic data. Although various pLSA-based applications, ranging from information retrieval, information filtering, to text-mining and visualization, have been successfully conducted, they can not afford dynamic revising of model when one of the factors changes constantly. In this paper, we take the advantage of decoupling ability of pLSA thoroughly, and propose a more elegant approach based on maximum likelihood estimation to gain an incremental learning with the drift of a factor. We demonstrate our method in the context of collaborative filtering where single user interests change fast, but the community interests remain almost constant. Experiments against the MovieLens and EachMovie data sets reveal that the proposed method improves the recommending accuracy 10% further beyond the original pLSA at a less computation cost
Keywords :
data mining; information filtering; learning (artificial intelligence); maximum likelihood estimation; probability; EachMovie data sets; MovieLens; collaborative information filtering; data visualization; dyadic data; factor drifting problem; incremental learning; information retrieval; maximum likelihood estimation; probabilistic latent semantic analysis; statistical technique; text-mining; Chaos; Collaboration; Computational efficiency; Information analysis; Information filtering; Information filters; Information retrieval; Information technology; Maximum likelihood estimation; Pattern analysis;
Conference_Titel :
Computer and Information Technology, 2005. CIT 2005. The Fifth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
0-7695-2432-X
DOI :
10.1109/CIT.2005.70