• DocumentCode
    187074
  • Title

    A Distributed NameNode Cluster for a Highly-Available Hadoop Distributed File System

  • Author

    Yonghwan Kim ; Araragi, Tadashi ; Nakamura, Jun ; Masuzawa, Toru

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
  • fYear
    2014
  • fDate
    6-9 Oct. 2014
  • Firstpage
    333
  • Lastpage
    334
  • Abstract
    Recently, Hadoop attracts much attention of engineers and researchers as an emerging and effective framework for Big Data. HDFS (Hadoop Distributed File System) can manage huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode, to manage the entire namespace of the file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode. Finally the scale up of a namespace is difficult because the NameNode manages all metadata of the namespace on its own memory, which is limited and expensive resource. In this paper, we propose a new HDFS architecture consisting of several NameNodes to resolve all the above problems.
  • Keywords
    Big Data; distributed databases; network operating systems; parallel processing; reliability; software architecture; Big Data; HDFS architecture; Hadoop distributed file system; SPOF; commodity hardware; distributed NameNode cluster; reliability; single point of failure; Distributed databases; Educational institutions; File systems; Hardware; Load management; Silicon; Synchronization; Distributed NameNodes; HDFS; Hadoop; High-Availability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems (SRDS), 2014 IEEE 33rd International Symposium on
  • Conference_Location
    Nara
  • Type

    conf

  • DOI
    10.1109/SRDS.2014.61
  • Filename
    6983409