DocumentCode :
3725605
Title :
Comparative analysis of effect of stopwords removal on sentiment classification
Author :
Kranti Vithal Ghag;Ketan Shah
Author_Institution :
Information Technology Department, MET´s SAKEC, Mumbai University, Mumbai, India
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
Classification refers to the computational techniques for classifying whether the sentiments of text are positive or negative. Sentiment Classification being a specialized domain of text mining is expected to benefit after preprocessing such as removing stopwords. Stopwords are frequently occurring words that hardly carry any information and orientation. In this paper the effect of stopwords removal on various sentiment classification models was analyzed. Sentiment Classification models were evaluated using the movie document dataset. Accuracy increased from unprocessed dataset to stopwords removed dataset for Traditional Sentiment Classifiers. Our classifiers had hardly any impact of stopwords removal which indicates that they handled stopwords at the time of classification itself. Our classifiers also displayed accuracy better than traditional classifier and another surveyed classifier based on term weighting technique.
Keywords :
"Support vector machines","Text mining","Computational modeling","Classification algorithms","Conferences","Computers","Feature extraction"
Publisher :
ieee
Conference_Titel :
Computer, Communication and Control (IC4), 2015 International Conference on
Type :
conf
DOI :
10.1109/IC4.2015.7375527
Filename :
7375527
Link To Document :
بازگشت