Title of article :
Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
Author/Authors :
Khan, Atif Department of Computer Science - Islamia College Peshawar, Peshawar, Pakistan , Adnan Gul, Muhammad Department of Computer Science - Islamia College Peshawar, Peshawar, Pakistan , Irfan Uddin, M. Institute of Computing - Kohat University of Science and Technology, Kohat, Pakistan , Shah, Atif Ali Faculty of Engineering and Information Technology - Northern University, Nowshehra, Pakistan , Ahmad, Shafiq King Saud University - College of Engineering - Department of Industrial Engineering, Riyadh, Saudi Arabia , Al Firdausi, Muhammad Dzulqarnain King Saud University - College of Engineering - Department of Industrial Engineering, Riyadh, Saudi Arabia , Zaindin, Mazen King Saud University - College of Science - Department of Statistics and Operations Research, Riyadh, Saudi Arabia
Pages :
14
From page :
1
To page :
14
Abstract :
Information is exploding on the web at exponential pace, so online movie review is becoming a substantial information resource for online users. However, users post millions of movie reviews on regular basis, and it is not possible for users to summarize the reviews. Movie review classification and summarization is one of the challenging tasks in natural language processing. Therefore, an automatic approach is demanded to summarize the vast amount of movie reviews, and it will allow the users to speedily distinguish the positive and negative aspects of a movie. This study has proposed an approach for movie review classification and summarization. For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents, and represent the review documents as a vector space model. Next, the Naïve Bayes algorithm is employed to classify the movie reviews (represented as a feature vector) into positive and negative reviews. For the task of movie review summarization, Word2vec feature extraction technique is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences. Different text features are used to calculate the salience score of each review sentence in clusters. Finally, the top-ranked sentences are chosen based on highest salience scores to produce the extractive summary of movie reviews. Experimental results reveal that the proposed machine learning approach is superior than other state-of-the-art approaches.
Keywords :
Summarizing Online Movie , Machine Learning Approach , Big Data Analytics
Journal title :
Scientific Programming
Serial Year :
2020
Full Text URL :
Record number :
2610813
Link To Document :
بازگشت