Title :
Combining parametric and nonparametric topic model to discover microblog event
Author :
Shengbing Liu ; Li Liu ; Ruzhong Cheng
Author_Institution :
Sch. of Electron. & Comput. Eng., Peking Univ., Shenzhen, China
Abstract :
Compared with traditional news media, microblog holds overwhelming superiority in fast-diffusion and comprehensive coverage of topics. Microblog becomes an effective, particular and important carrier of affair information and many other text analysis tasks, e.g., event discovering based on microblog have special significance. Common tools of content analysis, such as topic model, however, experience severe data sparsity problems due to short length of microblog. Following previous researchers´ idea, such as separating personal interest post from global event post, we further differentiate general topics from event topics and adopt nonparametric method to model the birth and death of event. We conduct experiments on Twitter data set, and the experimental results demonstrate that our method can not only discover event effectively, but also mine higher quality general topics.
Keywords :
natural language processing; social networking (online); text analysis; Twitter data set; microblog event discovery; nonparametric topic model; parametric topic model; Coherence; Computational linguistics; Computational modeling; Data models; Educational institutions; Media; Twitter; Gibbs Sampling; LDA; Nonparametric Topic Model;
Conference_Titel :
Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on
Conference_Location :
Sapporo
Print_ISBN :
978-1-4799-3196-5
DOI :
10.1109/InfoSEEE.2014.6946176