• DocumentCode
    2756966
  • Title

    Estimating the sentiment of social media content for security informatics applications

  • Author

    Glass, Kristin ; Colbaugh, Richard

  • Author_Institution
    New Mexico Inst. of Min. & Technol., Socorro, NM, USA
  • fYear
    2011
  • fDate
    10-12 July 2011
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    Inferring the sentiment of social media content, for instance blog posts and forum threads, is both of great interest to security analysts and technically challenging to accomplish. This paper presents two computational methods for estimating social media sentiment which address the challenges associated with Web-based analysis. Each method formulates the task as one of text classification, models the data as a bipartite graph of documents and words, and assumes that only limited prior information is available regarding the sentiment orientation of any of the documents or words of interest. The first algorithm is a semi-supervised sentiment classifier which combines knowledge of the sentiment labels for a few documents and words with information present in unlabeled data, which is abundant online. The second algorithm assumes existence of a set of labeled documents in a domain related to the domain of interest, and leverages these data to estimate sentiment in the target domain. We demonstrate the utility of the proposed methods by showing they outperform several standard techniques for the task of inferring the sentiment of online movie and consumer product reviews. Additionally, we illustrate the potential of the methods for security informatics by estimating regional public opinion regarding Egypt´s unfolding revolution through analysis of Arabic, Indonesian, and Danish (language) blog posts.
  • Keywords
    Internet; graph theory; pattern classification; security of data; social networking (online); text analysis; Web-based analysis; bipartite graph; blog posts; consumer product reviews; forum thread; online movie reviews; regional public opinion estimation; security analysts; security informatics application; semisupervised sentiment classifier; social media content sentiment estimation; text classification; Blogs; Glass; Media; Monitoring; Niobium; Security; USA Councils; machine learning; security informatics; sentiment analysis; social media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Security Informatics (ISI), 2011 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4577-0082-8
  • Type

    conf

  • DOI
    10.1109/ISI.2011.5984052
  • Filename
    5984052