مرکز منطقه ای اطلاع رساني علوم و فناوري - Pre-processing Boosting Twitter Sentiment Analysis?

DocumentCode :

3772349

Title :

Pre-processing Boosting Twitter Sentiment Analysis?

Author :

Zhao Jianqiang

Author_Institution :

Xi´an Jiaotong Univ., Xi´an, China

fYear :

2015

Firstpage :

748

Lastpage :

753

Abstract :

Twitter sentiment analysis offers organizations an ability to monitor public feeling towards the products and events related to them in real time. Most existing researches to identify Twitter sentiment are focused on the extraction of new sentiment features and apply pre-processing before features selection, although ignore the role of tweet pre-processing. In this paper, we discuss the effects of pre-processing on sentiment classification performance. We evaluated the effects of URL, stopword, repeated letters, negation, acronym and number on sentiment classification performance using two feature models and four classifiers on five Twitter datasets. The experiments show that the accuracy of sentiment classification rises after expanding acronym and replacing negation, although hardly change when removal URL, removal numbers and removal stopword are applied. The various pre-processing methods cause different influence on performance of classifiers for each dataset.

Keywords :

"Twitter","Sentiment analysis","Uniform resource locators","Support vector machines","Niobium","Radio frequency","Terminology"

Publisher :

ieee

Conference_Titel :

Smart City/SocialCom/SustainCom (SmartCity), 2015 IEEE International Conference on

Type :

conf

DOI :

10.1109/SmartCity.2015.158

Filename :

7463812

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3772349