DocumentCode :
536348
Title :
Blogger clustering by utilizing link information
Author :
Lu, Lu ; Zhu, Fuxi
Author_Institution :
Sch. of Comput., Wuhan Univ., Wuhan, China
Volume :
2
fYear :
2010
fDate :
29-31 Oct. 2010
Firstpage :
267
Lastpage :
270
Abstract :
Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the blogger´s browse behavior, which reflects the user´s interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k-medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective.
Keywords :
Internet; citation analysis; data mining; information resources; pattern clustering; Internet phenomenon; blogger browse behavior; blogger clustering; blogger post matrix; blogroll link; citation link; comment link; conditional probability; information resource; k-medoids algorithm; latent variable Z; link analysis; link information; opinionated personal online commentary; probability latent semantic analysis; Blogger cluster; K-medoids algorithm; PLSA Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-6582-8
Type :
conf
DOI :
10.1109/ICICISYS.2010.5658752
Filename :
5658752
Link To Document :
بازگشت