• DocumentCode
    2880
  • Title

    Algebraic Optimization for Processing Graph Pattern Queries in the Cloud

  • Author

    Anyanwu, Kemafor ; Kim, HyeongSik ; Ravindra, Padmashree

  • Author_Institution
    North Carolina State University
  • Volume
    17
  • Issue
    2
  • fYear
    2013
  • fDate
    March-April 2013
  • Firstpage
    52
  • Lastpage
    61
  • Abstract
    MapReduce platforms such as Hadoop are now the de facto standard for large-scale data processing, but they have significant limitations for join-intensive workloads typical in Semantic Web processing. This article overviews an algebraic optimization approach based on a Nested TripleGroup Data Model and Algebra (NTGA) that minimizes overall processing costs by reducing the number of MapReduce cycles. It also presents an approach for integrating NTGA-based processing of graph pattern queries into Apache Pig and compares it to execution plans using relational-style algebra operators.
  • Keywords
    Data processing; Optimization; Query processing; Resource description framework; database management; information technology and systems; query languages; query processing;
  • fLanguage
    English
  • Journal_Title
    Internet Computing, IEEE
  • Publisher
    ieee
  • ISSN
    1089-7801
  • Type

    jour

  • DOI
    10.1109/MIC.2012.22
  • Filename
    6138841