• DocumentCode
    44795
  • Title

    Interpreting the Public Sentiment Variations on Twitter

  • Author

    Shulong Tan ; Yang Li ; Huan Sun ; Ziyu Guan ; Xifeng Yan ; Jiajun Bu ; Chun Chen ; Xiaofei He

  • Author_Institution
    Zhejiang Key Lab. of Service Robot, Zhejiang Univ., Hangzhou, China
  • Volume
    26
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    1158
  • Lastpage
    1170
  • Abstract
    Millions of users share their opinions on Twitter, making it a valuable platform for tracking and analyzing public sentiment. Such tracking and analysis can provide critical information for decision making in various domains. Therefore it has attracted attention in both academia and industry. Previous research mainly focused on modeling and tracking public sentiment. In this work, we move one step further to interpret sentiment variations. We observed that emerging topics (named foreground topics) within the sentiment variation periods are highly related to the genuine reasons behind the variations. Based on this observation, we propose a Latent Dirichlet Allocation (LDA) based model, Foreground and Background LDA (FB-LDA), to distill foreground topics and filter out longstanding background topics. These foreground topics can give potential interpretations of the sentiment variations. To further enhance the readability of the mined reasons, we select the most representative tweets for foreground topics and develop another generative model called Reason Candidate and Background LDA (RCB-LDA) to rank them with respect to their “popularity” within the variation period. Experimental results show that our methods can effectively find foreground topics and rank reason candidates. The proposed models can also be applied to other tasks such as finding topic differences between two sets of documents.
  • Keywords
    data mining; decision making; information filtering; social networking (online); text analysis; FB-LDA; RCB-LDA; Twitter; data mining; decision making; documents; foreground topics; foreground-background LDA; generative model; latent Dirichlet allocation based model; longstanding background topics filtering; public sentiment analysis; public sentiment modeling; public sentiment tracking; public sentiment variation interpretation; reason candidate; representative tweets; Analytical models; Decision making; Educational institutions; Indexes; Resource management; Tracking; Twitter; Gibbs sampling; Text mining; Twitter; Web mining; emerging topic mining; latent Dirichlet allocation; public sentiment; sentiment analysis;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2013.116
  • Filename
    6560079