Title :
Topic selection in latent dirichlet allocation
Author :
Biao Wang ; Yang Liu ; Zelong Liu ; Maozhen Li ; Man Qi
Author_Institution :
State Grid Sichuan Electr. Power Res. Inst., Chengdu, China
Abstract :
Latent Dirichlet Allocation (LDA) has been widely applied to text mining. LDA is a probabilistic topic model which processes documents as the probability distribution of topics. One challenging issue in application of LDA is to select the optimal number of topics in LDA model. This paper presents a topic selection method which considers the density of each topic and computes the most unstable topic structure through an iteration process. Evaluation results show that the proposed method can generate an optimal number of topics automatically with a small number of iterations.
Keywords :
data mining; iterative methods; statistical distributions; text analysis; LDA; documents processing; iteration process; latent Dirichlet allocation; probabilistic topic model; probability distribution; text mining; topic selection; topic structure; Clustering algorithms; Computational modeling; Data models; Educational institutions; Probabilistic logic; Resource management; Vectors; MapReduce; data locality; job scheduling;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
DOI :
10.1109/FSKD.2014.6980931