Title :
Understanding the communication characteristics in HBase: What are the fundamental bottlenecks?
Author :
Wasi-ur-Rahman, Md ; Huang, Jian ; Jose, Jithin ; Ouyang, Xiangyong ; Wang, Hao ; Islam, Nusrat S. ; Subramoni, Hari ; Murthy, Chet ; Panda, Dhabaleswar K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
HBase is an open source, distributed, column-oriented Key/Value database. In this paper, we focus on analyzing the performance aspects of HBase. Existing literature on HBase provides high level descriptions of the operations and present overall performance results. We conducted comprehensive experiments and identified different factors contributing to the overall latency of Get and Put operations. Our experimental results reveal that communication time is about 67% and 45% for a 1 KB Get request over 1 Gigabit Ethernet (1 GigE) and 10 Gigabit Ethernet (10 GigE) networks, respectively, for in-memory workloads. Our results show that HBase communication stack and associated operations need to be re-designed for high-performance networks like InfiniBand and its features.
Keywords :
distributed databases; local area networks; public domain software; query processing; Ethernet; Get operation latency; Get request; HBase communication characteristics; HBase communication stack; InfiniBand; Put operation latency; bottleneck; communication time; high-performance network; in-memory workload; open source distributed column-oriented Key-Value database; Benchmark testing; Communication channels; Distributed databases; Network interfaces; Payloads; Servers; Size measurement;
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2012 IEEE International Symposium on
Conference_Location :
New Brunswick, NJ
Print_ISBN :
978-1-4673-1143-4
Electronic_ISBN :
978-1-4673-1145-8
DOI :
10.1109/ISPASS.2012.6189217