• DocumentCode
    1797367
  • Title

    Microblog hot topic detection based on topic model using term correlation matrix

  • Author

    Hui-Fang Ma ; Yue-Xin Sun ; Mei-Hui-Zi Jia ; Zhi-Chang Zhang

  • Author_Institution
    Coll. of Comput. Sci. & Eng., Northwest Normal Univ., Lanzhou, China
  • Volume
    1
  • fYear
    2014
  • fDate
    13-16 July 2014
  • Firstpage
    126
  • Lastpage
    130
  • Abstract
    In order to face the challenges of feature sparsity of short text messages for microblog hot topic detection, in this paper, we first explore the relation between terms, and then build term correlation matrix which is much denser than term-document matrix. Symmetric non-negative matrix factorization (SNMF) on term correlation matrix is applied to obtain term-topic matrix. Finally, we formulated the topic learning problem as probabilistic Latent semantic analysis (pLSA) on term-topic matrix. Besides, this paper also presents the definition of heat and mechanism of sorting the topics. Experiments show that our method can effectively cluster topics and be applied to microblog hot topic detection.
  • Keywords
    Web sites; document handling; learning (artificial intelligence); matrix decomposition; probability; semantic networks; sorting; SNMF; cluster topics; feature sparsity; microblog hot topic detection; pLSA; probabilistic latent semantic analysis; short text message; symmetric nonnegative matrix factorization; term correlation matrix; term-document matrix; term-topic matrix; topic learning problem; topic model; topic sorting; Abstracts; Cancer; Fans; Heating; TV; Visualization; Hot topic detection; Probabilistic latent semantic analysis; Symmetric non-negative matrix factorization; Term correlation matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
  • Conference_Location
    Lanzhou
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4799-4216-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2014.7009104
  • Filename
    7009104