Title :
Interactive mining topic evolutionary patterns from internet forums
Author :
Zhou, Bin ; Kai, Cui ; Jia, Yan ; Li, Jing
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
In many real-world topic detection tasks, the process of the topic detection is often interactive, which means the users are likely to interfere the reason process by expressing their preferences. We proposed an algorithm, iOLDA, and the software framework for interactive topic evolution pattern detection based on Latent Dirichlet Allocation (LDA). To abate those topics not interested or related, it allows the users to add supervised information by adjusting the posterior topic-word distributions at the end of each iteration, which may influence the inference process of the next iteration. Experiments are conducted both on English and Chinese corpus and the results show that the extracted topics capture meaningful themes in the data, and the proposed interaction policies can help to discover better topics.
Keywords :
Internet; data mining; iterative methods; natural language processing; user interfaces; Chinese corpus; English corpus; Internet; Latent Dirichlet Allocation; iOLDA; interactive mining; interactive topic evolution pattern; posterior topic-word distribution; topic detection; Clustering algorithms; Communication effectiveness; Computer science education; Data mining; Discussion forums; Educational technology; Electronic mail; Inference algorithms; Linear discriminant analysis; Software algorithms; data mining; probabilistic topic models; topic evolutionary patterns;
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
DOI :
10.1109/ICETC.2010.5530005