DocumentCode
536348
Title
Blogger clustering by utilizing link information
Author
Lu, Lu ; Zhu, Fuxi
Author_Institution
Sch. of Comput., Wuhan Univ., Wuhan, China
Volume
2
fYear
2010
fDate
29-31 Oct. 2010
Firstpage
267
Lastpage
270
Abstract
Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the blogger´s browse behavior, which reflects the user´s interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k-medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective.
Keywords
Internet; citation analysis; data mining; information resources; pattern clustering; Internet phenomenon; blogger browse behavior; blogger clustering; blogger post matrix; blogroll link; citation link; comment link; conditional probability; information resource; k-medoids algorithm; latent variable Z; link analysis; link information; opinionated personal online commentary; probability latent semantic analysis; Blogger cluster; K-medoids algorithm; PLSA Model;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4244-6582-8
Type
conf
DOI
10.1109/ICICISYS.2010.5658752
Filename
5658752
Link To Document