DocumentCode
1797367
Title
Microblog hot topic detection based on topic model using term correlation matrix
Author
Hui-Fang Ma ; Yue-Xin Sun ; Mei-Hui-Zi Jia ; Zhi-Chang Zhang
Author_Institution
Coll. of Comput. Sci. & Eng., Northwest Normal Univ., Lanzhou, China
Volume
1
fYear
2014
fDate
13-16 July 2014
Firstpage
126
Lastpage
130
Abstract
In order to face the challenges of feature sparsity of short text messages for microblog hot topic detection, in this paper, we first explore the relation between terms, and then build term correlation matrix which is much denser than term-document matrix. Symmetric non-negative matrix factorization (SNMF) on term correlation matrix is applied to obtain term-topic matrix. Finally, we formulated the topic learning problem as probabilistic Latent semantic analysis (pLSA) on term-topic matrix. Besides, this paper also presents the definition of heat and mechanism of sorting the topics. Experiments show that our method can effectively cluster topics and be applied to microblog hot topic detection.
Keywords
Web sites; document handling; learning (artificial intelligence); matrix decomposition; probability; semantic networks; sorting; SNMF; cluster topics; feature sparsity; microblog hot topic detection; pLSA; probabilistic latent semantic analysis; short text message; symmetric nonnegative matrix factorization; term correlation matrix; term-document matrix; term-topic matrix; topic learning problem; topic model; topic sorting; Abstracts; Cancer; Fans; Heating; TV; Visualization; Hot topic detection; Probabilistic latent semantic analysis; Symmetric non-negative matrix factorization; Term correlation matrix;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
Conference_Location
Lanzhou
ISSN
2160-133X
Print_ISBN
978-1-4799-4216-9
Type
conf
DOI
10.1109/ICMLC.2014.7009104
Filename
7009104
Link To Document