• DocumentCode
    2495844
  • Title

    WARP: Workload-aware replication and partitioning for RDF

  • Author

    Hose, K. ; Schenkel, Ralf

  • Author_Institution
    Dept. of Comput. Sci., Aalborg Univ., Aalborg, Denmark
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    With the increasing popularity of the Semantic Web, more and more data becomes available in RDF with SPARQL as a query language. Data sets, however, can become too big to be managed and queried on a single server in a scalable way. Existing distributed RDF stores approach this problem using data partitioning, aiming at limiting the communication between servers and exploiting parallelism. This paper proposes a distributed SPARQL engine that combines a graph partitioning technique with workload-aware replication of triples across partitions, enabling efficient query execution even for complex queries from the workload. Furthermore, it discusses query optimization techniques for producing efficient execution plans for ad-hoc queries not contained in the workload.
  • Keywords
    SQL; graph theory; pattern classification; query processing; replicated databases; semantic Web; ad hoc query; data partitioning; data server; data set management; distributed RDF; distributed SPARQL engine; graph partitioning technique; query execution; query language; query optimization technique; semantic Web; workload aware replication and partitioning; Distributed databases; Optimization; Parallel processing; Query processing; Resource description framework; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops (ICDEW), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • Print_ISBN
    978-1-4673-5303-8
  • Electronic_ISBN
    978-1-4673-5302-1
  • Type

    conf

  • DOI
    10.1109/ICDEW.2013.6547414
  • Filename
    6547414