• DocumentCode
    3764772
  • Title

    Analyzing BigData with Hadoop cluster in HDInsight azure Cloud

  • Author

    Aditya Bhardwaj;Vineet Kumar Singh; Vanraj;Yogendra Narayan

  • Author_Institution
    CSE Dept., National Institute of Technical Teachers Training and Research, Chandigarh, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Recently Cloud based Hadoop has gained a lot of interest that offer ready to use Hadoop cluster environment for processing of Big Data, eliminating the operational challenges of on-site hardware investment, IT support, and installing, configuring of Hadoop components such as HDFS and MapReduce. On demand Hadoop as a service helps the industries to focus on business growth and based on pay per use model for Big Data processing with auto-scaling of Hadoop cluster feature. In this paper implementation of various MapReduce jobs like Pi, TeraSort, WordCount has been done on cloud based Hadoop deployment by using Microsoft Azure cloud services. Performance of MapReduce jobs has been evaluated with respect to CPU execution time with varying size of Hadoop cluster. From the experimental result, it is found that CPU execution time to finish the jobs decrease as the number of Data Nodes in HDInsight cluster increases and indicates the good response time with increase in performance as well as more customer satisfaction.
  • Keywords
    "Big data","Cloud computing","Hardware","Servers","Computer architecture","Business","Distributed databases"
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2015 Annual IEEE
  • Electronic_ISBN
    2325-9418
  • Type

    conf

  • DOI
    10.1109/INDICON.2015.7443472
  • Filename
    7443472