Big data analysis using Apache Hadoop

Author

Nandimath, Jyoti ; Banerjee, Ekata ; Patil, Abhijit ; Kakade, Pratima ; Vaidya, Salil ; Chaturvedi, Divyansh

Author_Institution

Dept. of Comput. Eng., SKNCOE, Pune, India

fYear

2013

fDate

14-16 Aug. 2013

Firstpage

700

Lastpage

703

Abstract

The paradigm of processing huge datasets has been shifted from centralized architecture to distributed architecture. As the enterprises faced issues of gathering large chunks of data they found that the data cannot be processed using any of the existing centralized architecture solutions. Apart from time constraints, the enterprises faced issues of efficiency, performance and elevated infrastructure cost with the data processing in the centralized environment. With the help of distributed architecture these large organizations were able to overcome the problems of extracting relevant information from a huge data dump. One of the best open source tools used in the market to harness the distributed architecture in order to solve the data processing problems is Apache Hadoop. Using Apache Hadoop´s various components such as data clusters, map-reduce algorithms and distributed processing, we will resolve various location-based complex data problems and provide the relevant information back into the system, thereby increasing the user experience.

Keywords

data analysis; distributed processing; public domain software; software architecture; Apache Hadoop; big data analysis; centralized architecture; complex data problems; data clusters; data processing; data processing problems; distributed architecture; open source software; open source tools; relevant information; time constraints; Computers; Data handling; Data processing; Data storage systems; Distributed databases; Information management; Big data; Data processing; Hadoop; Map Reduce;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on

Conference_Location

San Francisco, CA

Type

conf

DOI

10.1109/IRI.2013.6642536

Filename

6642536