DocumentCode
187074
Title
A Distributed NameNode Cluster for a Highly-Available Hadoop Distributed File System
Author
Yonghwan Kim ; Araragi, Tadashi ; Nakamura, Jun ; Masuzawa, Toru
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
fYear
2014
fDate
6-9 Oct. 2014
Firstpage
333
Lastpage
334
Abstract
Recently, Hadoop attracts much attention of engineers and researchers as an emerging and effective framework for Big Data. HDFS (Hadoop Distributed File System) can manage huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode, to manage the entire namespace of the file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode. Finally the scale up of a namespace is difficult because the NameNode manages all metadata of the namespace on its own memory, which is limited and expensive resource. In this paper, we propose a new HDFS architecture consisting of several NameNodes to resolve all the above problems.
Keywords
Big Data; distributed databases; network operating systems; parallel processing; reliability; software architecture; Big Data; HDFS architecture; Hadoop distributed file system; SPOF; commodity hardware; distributed NameNode cluster; reliability; single point of failure; Distributed databases; Educational institutions; File systems; Hardware; Load management; Silicon; Synchronization; Distributed NameNodes; HDFS; Hadoop; High-Availability;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliable Distributed Systems (SRDS), 2014 IEEE 33rd International Symposium on
Conference_Location
Nara
Type
conf
DOI
10.1109/SRDS.2014.61
Filename
6983409
Link To Document