DocumentCode
70577
Title
Scaling Semantic Graph Databases in Size and Performance
Author
Morari, Alessandro ; Castellana, Vito Giovanni ; Villa, Oreste ; Tumeo, Antonino ; Weaver, James ; Haglin, David ; Choudhury, Sankhayan ; Feo, John
Author_Institution
Pacific Northwest Nat. Lab., Richland, WA, USA
Volume
34
Issue
4
fYear
2014
fDate
July-Aug. 2014
Firstpage
16
Lastpage
26
Abstract
GEMS is a full software system that implements a large-scale, semantic graph database on commodity clusters. Its framework comprises a SPARQL-to-C++ compiler, a library of distributed data structures, and a custom multithreaded runtime library. The authors evaluated their software stack on the Berlin SPARQL benchmark with datasets of up to 10 billion graph edges, demonstrating scaling in dataset size and performance as they added cluster nodes.
Keywords
data structures; multi-threading; program compilers; software libraries; very large databases; workstation clusters; Berlin SPARQL benchmark; GEMS; SPARQL-to-C++ compiler; commodity clusters; distributed data structure library; large-scale semantic graph database; multithreaded runtime library; performance scaling; size scaling; software stack; Big data; Cluster approximation; Data structures; Distributed processing; Multithreading; Resource description framework; Runtime; Semantics; SPARQL; big data; cluster; data aggregation; data analysis; distributed systems; graph databases; multithreading;
fLanguage
English
Journal_Title
Micro, IEEE
Publisher
ieee
ISSN
0272-1732
Type
jour
DOI
10.1109/MM.2014.39
Filename
6785922
Link To Document