• DocumentCode
    3460511
  • Title

    Association Link Network Based Core Events Discovery on the Web

  • Author

    Yang Liu ; Borhan, Norhayati ; Xiangfeng Luo ; Hui Zhang ; Xiang He

  • Author_Institution
    Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
  • fYear
    2013
  • fDate
    3-5 Dec. 2013
  • Firstpage
    553
  • Lastpage
    560
  • Abstract
    As documents are explosively increasing in the era of big data, document clustering has been proven to be useful for organizing online document streams into events. However, extant studies on document clustering still suffer from the problems of high dimensionality, scalability and accuracy. In this paper, we will present a novel association link network (ALN) based document clustering method, which is an adaptive iteration splitting process to discover core events on the web. In the iteration, we first detect community structures from ALN, then, map documents to the associated community based on words relations in ALN, finally rebuild communities using the mapped documents. Compared to existing document clustering methods, the effectiveness of presented clustering method in automatically discovering the web events is proved by the experimental results on real data set.
  • Keywords
    Big Data; Internet; document handling; Web events; World Wide Web; adaptive iteration splitting process; association link network; big data; core events discovery; document clustering; map documents; mapped documents; online document streams; Accuracy; Clustering algorithms; Clustering methods; Communities; Detection algorithms; Semantics; Vectors; adaptive splitting; association link network; community detection; document clustering; event discovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
  • Conference_Location
    Sydney, NSW
  • Type

    conf

  • DOI
    10.1109/CSE.2013.88
  • Filename
    6755268