Title :
Determine the Hardware Choice to Improve HDFS Performance Deployed in a Commodity Cluster
Author :
Youwei Wang ; Ge Fu ; Weiping Wang ; Xinran Liu ; Can Ma ; Dan Meng
Author_Institution :
Comput. Applic. Res. Center, Inst. of Comput. Technol., Beijing, China
Abstract :
The importance of storing and processing data eficiently is intensively highlighted in modern information technology infrastructures. Hadoop Distributed File System (HDFS) acts as the primary storage in modern cloud service environments and has been widely adopted for its portability and fault-tolerance. Current deployment of HDFS which runs on top of commodity hardware is unable to deliver desirable performance in terms of both latency and throughput. For data-intensive applications, I / O pressure becomes more exacerbated as the amount of data being stored and replicated to HDFS increases. In order to process extremely huge volume of data, investing in high-end hardware is one available practice. The primary contribution of this paper is to determine the I/O bottleneck for HDFS using both hardware and software approach and hence suggest corresponding solutions. Benchmarks and productivity tools are used to evaluate the proposed measure of improvement. The final conclusion about the crucial factor of the HDFS I/O performance is drawn based on experimental results.
Keywords :
benchmark testing; cloud computing; productivity; software fault tolerance; software performance evaluation; HDFS I/O performance; HDFS performance improvement; Hadoop distributed file system; I/O bottleneck; I/O pressure; benchmark tools; cloud service environments; commodity cluster; commodity hardware; data processing; data storing; data-intensive applications; fault-tolerance; hardware approach; high-end hardware; information technology infrastructures; productivity tools; software approach; Benchmark testing; Distributed databases; Hardware; Performance evaluation; Random access memory; Runtime; Throughput; Distributed file system; InfiniBand; Memory Storage; Performance;
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/CSE.2013.192