DocumentCode
3022018
Title
A way to construct evolution model of scientific papers based on the seed document and OLDA models
Author
Shanzeng Qiao ; Aili Han
Author_Institution
Dept. of Comput. Sci., Shandong Univ., Weihai, China
fYear
2013
fDate
20-22 Dec. 2013
Firstpage
900
Lastpage
903
Abstract
Tracking the topic evolution in scientific papers is an important issue. This paper presents a topic evolution framework which is based on the seed document and Online Latent Dirichlet Allocation (OLDA) models. We define seed document and link the seed documents of current time slice into the documents of next slice to keep the continuity of topics in content. And then, we run the OLDA in each time slice and take the word-topic posterior probability in previous time slice as the word-topic prior probability in current time slice to keep the continuity of topics in timeline. The cosine measure is used to compute the topic similarity in the adjacent time slices to find the topic evolution. The experimental results show that the way can help find the topic evolution in continuous time.
Keywords
probability; text analysis; OLDA model; cosine measure; online latent Dirichlet allocation; scientific paper; seed document; topic evolution model; topic posterior probability; topic similarity; Abstracts; Analytical models; Computational modeling; Data models; Hidden Markov models; Mathematical model; Neural networks; OLDA; natural language processing; seed document; topic evolution; topic model;
fLanguage
English
Publisher
ieee
Conference_Titel
Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on
Conference_Location
Shengyang
Print_ISBN
978-1-4799-2564-3
Type
conf
DOI
10.1109/MEC.2013.6885187
Filename
6885187
Link To Document