• DocumentCode
    262307
  • Title

    Multilingual Sentiment Classification on Large Textual Data

  • Author

    Polpinij, Jantima

  • Author_Institution
    Dept. of Comput. Sci., Mahasarakham Univ., Mahasarakham, Thailand
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    183
  • Lastpage
    188
  • Abstract
    At present, Big Data have been created lot of buzz in the technology world. Sentiment Analysis or opinion mining is one of the important applications of \´Big Data\´, where sentiment analysis is used for recognising voice or response of crowd for products, services. This concept describes the items in some detail and evaluate them as good/bad, preferred/not preferred. The results are very important for a company because customer feedback can yield extremely valuable insights about a company\´s customer. However, in a commercial website of product reviews, many customers can access to describe the items in some detail and evaluate them with different languages. Therefore, many companies will gather customer feedback in multiple languages. Definitely, feedback in multiple languages raises problems in analysing the material. As this, this paper proposes a solution to classify a product review dataset into two classes: positive and negative sentiments. The proposed methodology is called "Multilingual Sentiment Classification (MSC)". It consists of two main processing steps: lingual separation and sentiment classification. The first main processing step is to classify online product reviews into language classes. The second processing step is to classify each textual dataset into two classes: positive and negative sentiments. It is noted, we concentrate and experiment on bilingual texts (Thai and English).
  • Keywords
    Big Data; data mining; natural language processing; text analysis; Big Data; MSC; commercial Website; customer feedback; large textual data; lingual separation; multilingual sentiment classification; negative sentiments; opinion mining; positive sentiments; product reviews; sentiment analysis; Accuracy; Big data; Companies; Kernel; Large scale integration; Sentiment analysis; Support vector machines; Big Data; Bilingual Text; Multiple Language; Product Reviews; Sentiment Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
  • Conference_Location
    Sydney, NSW
  • Type

    conf

  • DOI
    10.1109/BDCloud.2014.15
  • Filename
    7034784