• DocumentCode
    578518
  • Title

    Mining twitterspace for information: Classifying sentiments programmatically using Java

  • Author

    Fiaidhi, Jinan ; Mohammed, Osama ; Mohammed, Sabah ; Fong, Simon ; Kim, Tai Hoon

  • Author_Institution
    Dept. of Comput. Sci., Lakehead Univ., Thunder Bay, ON, Canada
  • fYear
    2012
  • fDate
    22-24 Aug. 2012
  • Firstpage
    303
  • Lastpage
    308
  • Abstract
    People increasingly use Twitter to share advice, opinions, news, moods, concerns, facts, rumors, and everything else imaginable. Much of that data is public and available for mining. However, classifying automatically the sentiment of the Twitter messages into either positive or negative with respect to a query term represents a new research challenge. Variety of approaches that use natural language and statistical techniques failed to report high accuracy of tweets classification due to the nature of these tweets containing large number of abbreviations, emoticons and ill structured grammar. In this article we are presenting a programming approach that uses the Weka data mining APIs to classify tweets. Using this programming approach we can experiment on how to train the classifiers and determine which one is more effective than the others. In our experiments, the K* classifier is found to report a high degree of accuracy in tweets classification.
  • Keywords
    Java; application program interfaces; classification; data mining; query processing; social networking (online); Java; Twitter messages; Weka data mining API; emoticons; information mining; natural language; query term; sentiment classification; statistical techniques; structured grammar; tweets classification; twitterspace mining; Accuracy; Classification algorithms; Crawlers; Data mining; Earth; Programming; Twitter; classification algorithms; data mining; sentiment analysis; twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management (ICDIM), 2012 Seventh International Conference on
  • Conference_Location
    Macau
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-2428-1
  • Type

    conf

  • DOI
    10.1109/ICDIM.2012.6360089
  • Filename
    6360089