• DocumentCode
    578520
  • Title

    Opinion mining over twitterspace: Classifying tweets programmatically using the R approach

  • Author

    Fiaidhi, Jinan ; Mohammed, Osama ; Mohammed, Sabah ; Fong, Simon ; Kim, Tai Hoon

  • Author_Institution
    Dept. of Comput. Sci., Lakehead Univ., Thunder Bay, ON, Canada
  • fYear
    2012
  • fDate
    22-24 Aug. 2012
  • Firstpage
    313
  • Lastpage
    319
  • Abstract
    Today the channels for expressing opinions seem to increase daily. When these opinions are relevant to a company, they are important sources of business insight, whether they represent critical intelligence about a customer´s defection risk, the impact of an influential reviewer on other people´s purchase decisions, or early feedback on product releases, company news or competitors. Capturing and analyzing these opinions is a necessity for proactive product planning, marketing and customer service and it is also critical in maintaining brand integrity. The importance of harnessing opinion is growing as consumers use technologies such as Twitter to express their views directly to other consumers. Tracking the disparate sources of opinion is hard - but even harder is quickly and accurately extracting the meaning so companies can analyze and act. Tweets´ Language is complicated and contextual, especially when people are expressing opinions and requires reliable sentiment analysis based on parsing many linguistic shades of gray. This article argues that using the R programming platform for analyzing tweets programmatically simplifies the task of sentiment analysis and opinion mining. An R programming technique has been used for testing different sentiment lexicons as well as different scoring schemes. Experiments on analyzing the tweets of users over six NHL hockey teams reveals the effectively of using the opinion lexicon and the Latent Dirichlet Allocation (LDA) scoring scheme.
  • Keywords
    classification; consumer behaviour; customer services; purchasing; social networking (online); LDA scoring scheme; R approach; R programming platform; Tweet language; Twitterspace; brand integrity; business insight; classification; company news; critical intelligence; customer defection risk; customer service; latent Dirichlet allocation; marketing; opinion lexicon; opinion mining; parsing; proactive product planning; product feedback; purchase decision; sentiment analysis; sentiment lexicon; Companies; Data mining; Educational institutions; Internet; Natural language processing; Programming; Twitter; classification algorithms; data mining; sentiment analysis; twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management (ICDIM), 2012 Seventh International Conference on
  • Conference_Location
    Macau
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-2428-1
  • Type

    conf

  • DOI
    10.1109/ICDIM.2012.6360095
  • Filename
    6360095