DocumentCode :
3564291
Title :
Performance evaluation of HDFS in big data management
Author :
Dev, Dipayan ; Patgiri, Ripon
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Inst. of Technol., Silchar, India
fYear :
2014
Firstpage :
1
Lastpage :
7
Abstract :
Size of the data used in today´s enterprises has been growing at exponential rates from last few years. Simultaneously, the need to process and analyze the large volumes of data has also increased. To handle and for analysis of large datasets, an open-source implementation of Apache framework, Hadoop is used now-a-days. For managing and storing of all the resources across its cluster, Hadoop possesses a distributed file system called Hadoop Distributed File System(HDFS).HDFS is written completely in Java and is depicted in such a way that in can store Big Data more reliably, and can stream those at high processing time to the user applications. In recent days, Hadoop is used widely by popular organizations like Yahoo, Facebook and various online shopping market venders. On the other hand, Experiments on Data- Intensive computations are going on to parallelize the processing of data. None of them could actually achieve a desirable performance. Hadoop, with its Map-Reduce parallel data processing capability can achieve these goals efficiently [1]. This paper initially provides an overview of the HDFS in details. Later on, the paper reports the experimental work of Hadoop with the big data and suggests the various factors that affects the Hadoop cluster performance. Paper concludes with providing the different real field challenges of Hadoop in recent days and scope for future work.
Keywords :
Big Data; distributed databases; parallel processing; Apache framework; HDFS; Hadoop distributed file system; Java; MapReduce parallel data processing; big data management; data-intensive computations; performance evaluation; Benchmark testing; Reliability; DataNode; HDFS; JobTracker; Metadata; NameNode;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Applications (ICHPCA), 2014 International Conference on
Print_ISBN :
978-1-4799-5957-0
Type :
conf
DOI :
10.1109/ICHPCA.2014.7045330
Filename :
7045330
Link To Document :
بازگشت