DocumentCode
87803
Title
Mining user interest in microblogs with a user-topic model
Author
He Li ; Jia Yan ; Han Weihong ; Ding Zhaoyun
Author_Institution
Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
Volume
11
Issue
8
fYear
2014
fDate
Aug. 2014
Firstpage
131
Lastpage
144
Abstract
Microblogs have become an important platform for people to publish, transform information and acquire knowledge. This paper focuses on the problem of discovering user interest in microblogs. In this paper, we propose a topic mining model based on Latent Dirichlet Allocation (LDA) named user-topic model. For each user, the interests are divided into two parts by different ways to generate the microblogs: original interest and retweet interest. We represent a Gibbs sampling implementation for inference the parameters of our model, and discover not only user´s original interest, but also retweet interest. Then we combine original interest and retweet interest to compute interest words for users. Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task. And we find that original interest and retweet interest are similar and the topics of interest contain user labels. The interest words discovered by our model reflect user labels, but range is much broader.
Keywords
data mining; social networking (online); Gibbs sampling; LDA; Sina microblog dataset; latent Dirichlet allocation; original interest; retweet interest; topic mining model; user interest mining; user labels; user-topic model; Computational modeling; Data models; Educational institutions; Mathematical model; Probability distribution; Twitter; LDA; microblogs; topic mining; user interest; user-topic model;
fLanguage
English
Journal_Title
Communications, China
Publisher
ieee
ISSN
1673-5447
Type
jour
DOI
10.1109/CC.2014.6911095
Filename
6911095
Link To Document