• DocumentCode
    536348
  • Title

    Blogger clustering by utilizing link information

  • Author

    Lu, Lu ; Zhu, Fuxi

  • Author_Institution
    Sch. of Comput., Wuhan Univ., Wuhan, China
  • Volume
    2
  • fYear
    2010
  • fDate
    29-31 Oct. 2010
  • Firstpage
    267
  • Lastpage
    270
  • Abstract
    Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the blogger´s browse behavior, which reflects the user´s interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k-medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective.
  • Keywords
    Internet; citation analysis; data mining; information resources; pattern clustering; Internet phenomenon; blogger browse behavior; blogger clustering; blogger post matrix; blogroll link; citation link; comment link; conditional probability; information resource; k-medoids algorithm; latent variable Z; link analysis; link information; opinionated personal online commentary; probability latent semantic analysis; Blogger cluster; K-medoids algorithm; PLSA Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4244-6582-8
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2010.5658752
  • Filename
    5658752