DocumentCode :
1877627
Title :
Addressing big data problem using Hadoop and Map Reduce
Author :
Patel, Akash B. ; Birla, M. ; Nair, Usha
fYear :
2012
fDate :
6-8 Dec. 2012
Firstpage :
1
Lastpage :
5
Abstract :
The size of the databases used in today´s enterprises has been growing at exponential rates day by day. Simultaneously, the need to process and analyze the large volumes of data for business decision making has also increased. In several business and scientific applications, there is a need to process terabytes of data in efficient manner on daily bases. This has contributed to the big data problem faced by the industry due to the inability of conventional database systems and software tools to manage or process the big data sets within tolerable time limits. Processing of data can include various operations depending on usage like culling, tagging, highlighting, indexing, searching, faceting, etc operations. It is not possible for single or few machines to store or process this huge amount of data in a finite time period. This paper reports the experimental work on big data problem and its optimal solution using Hadoop cluster, Hadoop Distributed File System (HDFS) for storage and using parallel processing to process large data sets using Map Reduce programming framework. We have done prototype implementation of Hadoop cluster, HDFS storage and Map Reduce framework for processing large data sets by considering prototype of big data application scenarios. The results obtained from various experiments indicate favorable results of above approach to address big data problem.
Keywords :
business data processing; data handling; database management systems; decision making; distributed processing; file organisation; HDFS; Map Reduce programming framework; addressing big data problem; business decision making; conventional database systems; distributed processing; exponential rates; hadoop cluster; hadoop distributed file system; prototype implementation; software tools; Big Data Problem; Hadoop Distributed File System; Hadoop cluster; Map Reduce; Parallel Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering (NUiCONE), 2012 Nirma University International Conference on
Conference_Location :
Ahmedabad
Print_ISBN :
978-1-4673-1720-7
Type :
conf
DOI :
10.1109/NUICONE.2012.6493198
Filename :
6493198
Link To Document :
بازگشت