• DocumentCode
    2785152
  • Title

    An Efficient Algorithm of Hot Events Detection in Text Streams

  • Author

    Bai, Junliang ; Guo, Jun ; Chen, Guang ; Xu, Weiran ; Du, Gang

  • Author_Institution
    Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2010
  • fDate
    10-12 Oct. 2010
  • Firstpage
    321
  • Lastpage
    326
  • Abstract
    Hot events detection in text streams has drawn increasing attention in recent sequential data mining works. Different from traditional TDT task which find all the real events´ cluster, hot events detection only identify hot events concerned by public. This paper proposes a novel approach to identify those events based on burst terms, terms co-occurrence and generative probabilistic model. Experiments with huge text stream sets crawled from WWW suggest that our algorithm can work on-line and identify hot events effectively and efficiently.
  • Keywords
    data mining; text analysis; burst terms; efficient algorithm; generative probabilistic model; hot events detection; sequential data mining; terms cooccurrence; text streams; Algorithm design and analysis; Clustering algorithms; Computational modeling; Earthquakes; Event detection; Helium; Web sites; Algorithm; Data Mining; Hot Events Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2010 International Conference on
  • Conference_Location
    Huangshan
  • Print_ISBN
    978-1-4244-8434-8
  • Electronic_ISBN
    978-0-7695-4235-5
  • Type

    conf

  • DOI
    10.1109/CyberC.2010.65
  • Filename
    5617108