• DocumentCode
    736372
  • Title

    An optimal approach for social data analysis in Big Data

  • Author

    Kamala V.R ; MaryGladence, L.

  • Author_Institution
    Department ofInformation Technonology, Sathyabama University, Chennai, India
  • fYear
    2015
  • fDate
    22-23 April 2015
  • Abstract
    The term Big Data refers to huge, complex and heterogeneous data. Based on the HACE characteristics of Big Data, which is Heterogeneous, Autonomous, Complex and Evolving associations, there are many algorithms proposed. Hadoop is an open source framework used extensively for distributed storage and processing. Hadoop framework provides parallel distributive data processing standards which increases the overall computational power and processing time. But choosing the right component for our requirement is an important task. It helps in optimizing the overall performance of the data analysis irrespective of data volume. Here we describe the Hadoop technology stack and their optimal usage for analyzing various data sources, especially the social data.
  • Keywords
    Big data; Data mining; Java; Knowledge engineering; Reliability; Sparks; Big Data; HACE; Hadoop; Hive; Spark;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computation of Power, Energy Information and Commuincation (ICCPEIC), 2015 International Conference on
  • Conference_Location
    Melmaruvathur, Chennai, India
  • Print_ISBN
    978-1-4673-6524-6
  • Type

    conf

  • DOI
    10.1109/ICCPEIC.2015.7259464
  • Filename
    7259464