Title :
Understanding query performance in Accumulo
Author :
Sawyer, Scott M. ; O´Gwynn, B. David ; An Tran ; yu, tao
Abstract :
Open-source, BigTable-like distributed databases provide a scalable storage solution for data-intensive applications. The simple key-value storage schema provides fast record ingest and retrieval, nearly independent of the quantity of data stored. However, real applications must support non-trivial queries that require careful key design and value indexing. We study an Apache Accumulo-based big data system designed for a network situational awareness application. The application´s storage schema and data retrieval requirements are analyzed. We then characterize the corresponding Accumulo performance bottlenecks. Queries are shown to be communication-bound and server-bound in different situations. Inefficiencies in the open-source communication stack and filesystem limit network and I/O performance, respectively. Additionally, in some situations, parallel clients can contend for server-side resources. Maximizing data retrieval rates for practical queries requires effective key design, indexing, and client parallelization.
Keywords :
client-server systems; database indexing; parallel databases; public domain software; query processing; Accumulo performance bottlenecks; Apache Accumulo-based big-data system; I/O performance; application storage schema; client parallelization; communication-bounded queries; data retrieval rate maximization; data retrieval requirements; data-intensive applications; filesystem limit network; key design; key-value storage schema; network situational awareness application; nontrivial queries; open-source BigTable-like distributed databases; open-source communication stack; query performance; scalable storage solution; server-bounded queries; server-side resources; value indexing; Data handling; Data storage systems; Histograms; Indexes; Information management; Servers;
Conference_Titel :
High Performance Extreme Computing Conference (HPEC), 2013 IEEE
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4799-1364-0
DOI :
10.1109/HPEC.2013.6670330