DocumentCode :
1797367
Title :
Microblog hot topic detection based on topic model using term correlation matrix
Author :
Hui-Fang Ma ; Yue-Xin Sun ; Mei-Hui-Zi Jia ; Zhi-Chang Zhang
Author_Institution :
Coll. of Comput. Sci. & Eng., Northwest Normal Univ., Lanzhou, China
Volume :
1
fYear :
2014
fDate :
13-16 July 2014
Firstpage :
126
Lastpage :
130
Abstract :
In order to face the challenges of feature sparsity of short text messages for microblog hot topic detection, in this paper, we first explore the relation between terms, and then build term correlation matrix which is much denser than term-document matrix. Symmetric non-negative matrix factorization (SNMF) on term correlation matrix is applied to obtain term-topic matrix. Finally, we formulated the topic learning problem as probabilistic Latent semantic analysis (pLSA) on term-topic matrix. Besides, this paper also presents the definition of heat and mechanism of sorting the topics. Experiments show that our method can effectively cluster topics and be applied to microblog hot topic detection.
Keywords :
Web sites; document handling; learning (artificial intelligence); matrix decomposition; probability; semantic networks; sorting; SNMF; cluster topics; feature sparsity; microblog hot topic detection; pLSA; probabilistic latent semantic analysis; short text message; symmetric nonnegative matrix factorization; term correlation matrix; term-document matrix; term-topic matrix; topic learning problem; topic model; topic sorting; Abstracts; Cancer; Fans; Heating; TV; Visualization; Hot topic detection; Probabilistic latent semantic analysis; Symmetric non-negative matrix factorization; Term correlation matrix;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
Conference_Location :
Lanzhou
ISSN :
2160-133X
Print_ISBN :
978-1-4799-4216-9
Type :
conf
DOI :
10.1109/ICMLC.2014.7009104
Filename :
7009104
Link To Document :
بازگشت