DocumentCode
262307
Title
Multilingual Sentiment Classification on Large Textual Data
Author
Polpinij, Jantima
Author_Institution
Dept. of Comput. Sci., Mahasarakham Univ., Mahasarakham, Thailand
fYear
2014
fDate
3-5 Dec. 2014
Firstpage
183
Lastpage
188
Abstract
At present, Big Data have been created lot of buzz in the technology world. Sentiment Analysis or opinion mining is one of the important applications of \´Big Data\´, where sentiment analysis is used for recognising voice or response of crowd for products, services. This concept describes the items in some detail and evaluate them as good/bad, preferred/not preferred. The results are very important for a company because customer feedback can yield extremely valuable insights about a company\´s customer. However, in a commercial website of product reviews, many customers can access to describe the items in some detail and evaluate them with different languages. Therefore, many companies will gather customer feedback in multiple languages. Definitely, feedback in multiple languages raises problems in analysing the material. As this, this paper proposes a solution to classify a product review dataset into two classes: positive and negative sentiments. The proposed methodology is called "Multilingual Sentiment Classification (MSC)". It consists of two main processing steps: lingual separation and sentiment classification. The first main processing step is to classify online product reviews into language classes. The second processing step is to classify each textual dataset into two classes: positive and negative sentiments. It is noted, we concentrate and experiment on bilingual texts (Thai and English).
Keywords
Big Data; data mining; natural language processing; text analysis; Big Data; MSC; commercial Website; customer feedback; large textual data; lingual separation; multilingual sentiment classification; negative sentiments; opinion mining; positive sentiments; product reviews; sentiment analysis; Accuracy; Big data; Companies; Kernel; Large scale integration; Sentiment analysis; Support vector machines; Big Data; Bilingual Text; Multiple Language; Product Reviews; Sentiment Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
Conference_Location
Sydney, NSW
Type
conf
DOI
10.1109/BDCloud.2014.15
Filename
7034784
Link To Document