DocumentCode :
238913
Title :
Big data processing with harnessing hadoop - MapReduce for optimizing analytical workloads
Author :
Rama Satish, K.V. ; Kavya, N.P.
Author_Institution :
RNS Inst. of Technol., Bangalore, India
fYear :
2014
fDate :
27-29 Nov. 2014
Firstpage :
49
Lastpage :
54
Abstract :
Now a days, we are living with social media data like heartbeat. The exponential growth with data first presented challenges to cutting-edge businesses such as Google, MSN, Flipkart, Microsoft, Facebook, Twitter, LinkedIn etc. Nevertheless, existing big data analytical models for hadoop comply with MapReduce analytical workloads that process a small segment of the whole data set, thus failing to assess the capabilities of the MapReduce model under heavy workloads that process exponentially accumulative data sizes.[1] In all social business and technical research applications, there is a need to process big data of data in efficient manner on normal uses data. In this paper, we have proposed an efficient technique to classify the big data from e-mail using firefly and naïve bayes classifier. Proposed technique is comprised into two phase, (i) Map reduce framework for training and (ii) Map reduce framework for testing. Initially, the input twitter data is given to the process to select the suitable feature for data classification. The traditional firefly algorithm is applied and the optimized feature space is adopted for the best fitting results. Once the best feature space is identified through firefly algorithm, the data classification is done using the naïve bayes classifier. Here, these two processes are effectively distributed based on the concept given in Map-Reduce framework. The results of the experiment are validated using evaluation metrics namely, computation time, accuracy, specificity and sensitivity. For comparative analysis, proposed big data classification is compared with the existing works of naïve bayes and neural network.
Keywords :
Bayes methods; Big Data; neural nets; parallel processing; pattern classification; social networking (online); Facebook; Flipkart; Google; Hadoop; LinkedIn; MSN; MapReduce analytical workload; Microsoft; Twitter; analytical workloads; big data analytical model; big data processing; cutting-edge business; data classification; e-mail data; evaluation metrics; feature space; firefly algorithm; map-reduce framework; naïve Bayes classifier; naïve bayes classifier; neural network; social business; social media; technical research application; twitter data; Big data; Business; Classification algorithms; Computer architecture; Data models; Electronic mail; Testing; Big data feature selection; Map reduce framework; classification; firefly; naïve-bayes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Contemporary Computing and Informatics (IC3I), 2014 International Conference on
Conference_Location :
Mysore
Type :
conf
DOI :
10.1109/IC3I.2014.7019818
Filename :
7019818
Link To Document :
بازگشت