Title :
Toward Theme Development Analysis with Topic Clustering
Author :
Geng, Xueyu ; Wang, Jinlong
Author_Institution :
Sch. of Civil Eng., Qingdao Technol. Univ., Qingdao
Abstract :
Topic summarization and analysis is very important to understand an academic document collection and is very paramount for scientific research, which can help researchers find the hot field. Many scholars used the topic model to analyze the theme development, such as LDA. However, these methods need a pre-specified number of latent topics and manual topic labeling, which is usually difficult for people. Aiming to this problem, this paper proposes a method to analyze theme development with topic clustering. Different from the existing works, this paper uses the sliding window to cluster topics extracted in different time incrementally, the topic distance can be measured with KL-divergence. Some experiments on real data sets validate the effectiveness of our proposed method.
Keywords :
data mining; natural sciences computing; pattern clustering; text analysis; KL-divergence; academic document collection; scientific research; sliding window; theme development analysis; topic analysis; topic clustering; topic distance; topic model mining; topic summarization; Civil engineering; Data mining; Frequency; Information analysis; Labeling; Linear discriminant analysis; Polynomials; Text mining; Time measurement;
Conference_Titel :
Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3489-3
DOI :
10.1109/ICACTE.2008.206