DocumentCode
2785152
Title
An Efficient Algorithm of Hot Events Detection in Text Streams
Author
Bai, Junliang ; Guo, Jun ; Chen, Guang ; Xu, Weiran ; Du, Gang
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2010
fDate
10-12 Oct. 2010
Firstpage
321
Lastpage
326
Abstract
Hot events detection in text streams has drawn increasing attention in recent sequential data mining works. Different from traditional TDT task which find all the real events´ cluster, hot events detection only identify hot events concerned by public. This paper proposes a novel approach to identify those events based on burst terms, terms co-occurrence and generative probabilistic model. Experiments with huge text stream sets crawled from WWW suggest that our algorithm can work on-line and identify hot events effectively and efficiently.
Keywords
data mining; text analysis; burst terms; efficient algorithm; generative probabilistic model; hot events detection; sequential data mining; terms cooccurrence; text streams; Algorithm design and analysis; Clustering algorithms; Computational modeling; Earthquakes; Event detection; Helium; Web sites; Algorithm; Data Mining; Hot Events Detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2010 International Conference on
Conference_Location
Huangshan
Print_ISBN
978-1-4244-8434-8
Electronic_ISBN
978-0-7695-4235-5
Type
conf
DOI
10.1109/CyberC.2010.65
Filename
5617108
Link To Document