• DocumentCode
    3429007
  • Title

    High performance RDMA-based design of HDFS over InfiniBand

  • Author

    Islam, Nusrat Sharmin ; Rahman, Mohammad Wahidur ; Jose, Jithin ; Rajachandrasekar, Raghunath ; Wang, Huifang ; Subramoni, Hari ; Murthy, Cherukuri ; Panda, Dhabaleswar K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    Hadoop Distributed File System (HDFS) acts as the primary storage of Hadoop and has been adopted by reputed organizations (Facebook, Yahoo! etc.) due to its portability and fault-tolerance. The existing implementation of HDFS uses Javasocket interface for communication which delivers suboptimal performance in terms of latency and throughput. For dataintensive applications, network performance becomes key component as the amount of data being stored and replicated to HDFS increases. In this paper, we present a novel design of HDFS using Remote Direct Memory Access (RDMA) over InfiniBand via JNI interfaces. Experimental results show that, for 5GB HDFS file writes, the new design reduces the communication time by 87% and 30% over 1Gigabit Ethernet (1GigE) and IP-over-InfiniBand (IPoIB), respectively, on QDR platform (32Gbps). For HBase, the Put operation performance is improved by 26% with our design. To the best of our knowledge, this is the first design of HDFS over InfiniBand networks.
  • Keywords
    IP networks; computer network performance evaluation; distributed databases; fault tolerant computing; local area networks; public domain software; telecommunication switching; 1Gigabit Ethernet; Facebook; HBase; HDFS; Hadoop distributed file system; IP-over-InfiniBand; IPoIB; JNI interfaces; Javasocket interface; Put operation performance improvement; QDR platform; Yahoo!; bit rate 32 Gbit/s; communication reduction; data intensive applications; fault-tolerance; high performance RDMA-based design; latency; network performance; remote direct memory access; throughput; IP networks; Java; Libraries; Servers; Sockets; Software; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    2167-4329
  • Print_ISBN
    978-1-4673-0805-2
  • Type

    conf

  • DOI
    10.1109/SC.2012.65
  • Filename
    6468497