Title of article :
Effective Sentiment Analysis of Twitter with Apache Spark
Author/Authors :
Sasikanth, K. V. K. Department of CSE - GITE, Rajahmundry, A.P, India , Samatha, K. Department of CSE - JNTUK, Kakinada, A.P, India , Deshai, N. Department of IT - SRKREC, Bhimavaram, A.P, India , Sekhar, B. V. D. S. Department of IT - SRKREC, Bhimavaram, A.P, India , Venkatramana, S. Department of IT - SRKREC, Bhimavaram, A.P, India
Abstract :
Today’s interconnected world generates a huge amount of digital data while millions of users share
their opinions and feelings on various topics through popular applications such as social media,
different micro blogging sites, and various review websites every day. Nowadays, applying sentiment
analysis to Twitter data is regarded as a considerable problem, particularly for various organizations
or companies who seek to know customers’ feelings and opinions about their products and services.
The nature, variety, and enormous size of the data make it considerably practical for several
applications ranging from choice and decision making to product assessment. Tweets are being used to
convey the sentiment of a tweeter on a specific topic. Those companies keep surveying millions of
tweets on some kinds of subjects to evaluate actual opinions and know the customers’ feelings. This
paper aims to significantly collect, recognize, filter, reduce, and analyze all such relevant opinions,
emotions, and feelings of people on different products or services which could be categorized into
positive, negative, or neutral because such categorization improves sales growth of a company's
products, films, etc. The Naïve Bayes classifier is the mainly utilized machine learning method for
mining feelings from a large quantity of data, like twitter and other popular social networks, due to its
higher accuracy rates. This study performs sentiment polarity analysis on Twitter data in a distributed
environment, known as Apache Spark.
Keywords :
Big Data , Machine Learning , SVM , Map Reduce Spark Framework , Naïve Bayes , Sentiment Analysis , Natural language processing
Journal title :
International Journal of Industrial Engineering and Production Research