DocumentCode :
3165523
Title :
TopicSketch: Real-Time Bursty Topic Detection from Twitter
Author :
Wei Xie ; Feida Zhu ; Jing Jiang ; Ee-Peng Lim ; Ke Wang
Author_Institution :
Living Analytics Res. Centre, Singapore Manage. Univ., Singapore, Singapore
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
837
Lastpage :
846
Abstract :
Twitter has become one of the largest platforms for users around the world to share anything happening around them with friends and beyond. A bursty topic in Twitter is one that triggers a surge of relevant tweets within a short time, which often reflects important events of mass interest. How to leverage Twitter for early detection of bursty topics has therefore become an important research problem with immense practical value. Despite the wealth of research work on topic modeling and analysis in Twitter, it remains a huge challenge to detect bursty topics in real-time. As existing methods can hardly scale to handle the task with the tweet stream in real-time, we propose in this paper Topic Sketch, a novel sketch-based topic model together with a set of techniques to achieve real-time detection. We evaluate our solution on a tweet stream with over 30 million tweets. Our experiment results show both efficiency and effectiveness of our approach. Especially it is also demonstrated that Topic Sketch can potentially handle hundreds of millions tweets per day which is close to the total number of daily tweets in Twitter and present bursty event in finer-granularity.
Keywords :
information analysis; social networking (online); TopicSketch; Twitter; real-time bursty topic detection; sketch-based topic model; topic analysis; topic modeling; tweet stream; Acceleration; Equations; Monitoring; Optimization; Real-time systems; Surges; Twitter; TopicSketch; bursty topic; realtime; tweet stream;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2013.86
Filename :
6729568
Link To Document :
بازگشت