Addressing big data problem using Hadoop and Map Reduce

Author

Patel, Akash B. ; Birla, M. ; Nair, Usha

fYear

2012

fDate

6-8 Dec. 2012

Firstpage

1

Lastpage

5

Abstract

The size of the databases used in today´s enterprises has been growing at exponential rates day by day. Simultaneously, the need to process and analyze the large volumes of data for business decision making has also increased. In several business and scientific applications, there is a need to process terabytes of data in efficient manner on daily bases. This has contributed to the big data problem faced by the industry due to the inability of conventional database systems and software tools to manage or process the big data sets within tolerable time limits. Processing of data can include various operations depending on usage like culling, tagging, highlighting, indexing, searching, faceting, etc operations. It is not possible for single or few machines to store or process this huge amount of data in a finite time period. This paper reports the experimental work on big data problem and its optimal solution using Hadoop cluster, Hadoop Distributed File System (HDFS) for storage and using parallel processing to process large data sets using Map Reduce programming framework. We have done prototype implementation of Hadoop cluster, HDFS storage and Map Reduce framework for processing large data sets by considering prototype of big data application scenarios. The results obtained from various experiments indicate favorable results of above approach to address big data problem.

Keywords

business data processing; data handling; database management systems; decision making; distributed processing; file organisation; HDFS; Map Reduce programming framework; addressing big data problem; business decision making; conventional database systems; distributed processing; exponential rates; hadoop cluster; hadoop distributed file system; prototype implementation; software tools; Big Data Problem; Hadoop Distributed File System; Hadoop cluster; Map Reduce; Parallel Processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Engineering (NUiCONE), 2012 Nirma University International Conference on

Conference_Location

Ahmedabad

Print_ISBN

978-1-4673-1720-7

Type

conf

DOI

10.1109/NUICONE.2012.6493198

Filename

6493198