• DocumentCode
    142153
  • Title

    Combining parametric and nonparametric topic model to discover microblog event

  • Author

    Shengbing Liu ; Li Liu ; Ruzhong Cheng

  • Author_Institution
    Sch. of Electron. & Comput. Eng., Peking Univ., Shenzhen, China
  • Volume
    3
  • fYear
    2014
  • fDate
    26-28 April 2014
  • Firstpage
    1527
  • Lastpage
    1531
  • Abstract
    Compared with traditional news media, microblog holds overwhelming superiority in fast-diffusion and comprehensive coverage of topics. Microblog becomes an effective, particular and important carrier of affair information and many other text analysis tasks, e.g., event discovering based on microblog have special significance. Common tools of content analysis, such as topic model, however, experience severe data sparsity problems due to short length of microblog. Following previous researchers´ idea, such as separating personal interest post from global event post, we further differentiate general topics from event topics and adopt nonparametric method to model the birth and death of event. We conduct experiments on Twitter data set, and the experimental results demonstrate that our method can not only discover event effectively, but also mine higher quality general topics.
  • Keywords
    natural language processing; social networking (online); text analysis; Twitter data set; microblog event discovery; nonparametric topic model; parametric topic model; Coherence; Computational linguistics; Computational modeling; Data models; Educational institutions; Media; Twitter; Gibbs Sampling; LDA; Nonparametric Topic Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on
  • Conference_Location
    Sapporo
  • Print_ISBN
    978-1-4799-3196-5
  • Type

    conf

  • DOI
    10.1109/InfoSEEE.2014.6946176
  • Filename
    6946176