DocumentCode
736372
Title
An optimal approach for social data analysis in Big Data
Author
Kamala V.R ; MaryGladence, L.
Author_Institution
Department ofInformation Technonology, Sathyabama University, Chennai, India
fYear
2015
fDate
22-23 April 2015
Abstract
The term Big Data refers to huge, complex and heterogeneous data. Based on the HACE characteristics of Big Data, which is Heterogeneous, Autonomous, Complex and Evolving associations, there are many algorithms proposed. Hadoop is an open source framework used extensively for distributed storage and processing. Hadoop framework provides parallel distributive data processing standards which increases the overall computational power and processing time. But choosing the right component for our requirement is an important task. It helps in optimizing the overall performance of the data analysis irrespective of data volume. Here we describe the Hadoop technology stack and their optimal usage for analyzing various data sources, especially the social data.
Keywords
Big data; Data mining; Java; Knowledge engineering; Reliability; Sparks; Big Data; HACE; Hadoop; Hive; Spark;
fLanguage
English
Publisher
ieee
Conference_Titel
Computation of Power, Energy Information and Commuincation (ICCPEIC), 2015 International Conference on
Conference_Location
Melmaruvathur, Chennai, India
Print_ISBN
978-1-4673-6524-6
Type
conf
DOI
10.1109/ICCPEIC.2015.7259464
Filename
7259464
Link To Document