DocumentCode :
188128
Title :
Rough Set Theory for Arabic Sentiment Classification
Author :
Al-Radaideh, Qasem A. ; Twaiq, Laila M.
Author_Institution :
Dept. of Comput. Inf. Syst., Yarmouk Univ., Irbid, Jordan
fYear :
2014
fDate :
27-29 Aug. 2014
Firstpage :
559
Lastpage :
564
Abstract :
Recently, the web has been a major place where people interact and express their views and sentiments. Researchers were attracted to conduct further analysis on this rich content known as Sentiment Analysis, Sentiment Classification or Opinion Mining. Rough Set Theory is a mathematical tool that can be used for classification and analysis of uncertain, incomplete or vague information. It can be used to significantly reduce the dimensionality of the data without much loss in information content, which is achieved using the concept of Reduct. This paper focuses on investigating the use of the Rough Set theory approach for Arabic Sentiment Classification. This paper compares some approaches that have been proposed to find Reducts to classify Arabic tweeting reviews. The Rosetta toolkit is used for testing where two main Reduct approaches were applied: Johnson Reducer and Genetic-based reducer. We compared the results of the approaches using cross validation evaluation method. The results showed that Genetic reducer achieved 57% of accuracy, which outperformed Johnson Reducer. The paper concludes that the Rough Set based approach is applicable for sentiment analysis of Arabic text but further investigation is required to evaluate other Reduct generation methods.
Keywords :
data mining; natural language processing; pattern classification; rough set theory; text analysis; Arabic sentiment classification; Arabic text analysis; Arabic tweeting review classification; Johnson Reducer; Reduct generation methods; Rosetta toolkit; cross validation evaluation method; genetic-based reducer; information content; mathematical tool; opinion mining; rough set theory approach; sentiment analysis; sentiment classification; vague information; Accuracy; Genetics; Sentiment analysis; Set theory; Support vector machines; Testing; Training; Arabic sentiment analysis; Reduct Generation; Rough Set Theory (RST);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Internet of Things and Cloud (FiCloud), 2014 International Conference on
Conference_Location :
Barcelona
Type :
conf
DOI :
10.1109/FiCloud.2014.97
Filename :
6984253
Link To Document :
بازگشت