• DocumentCode
    610371
  • Title

    Interval indexing and querying on key-value cloud stores

  • Author

    Sfakianakis, Georgios ; Patlakas, I. ; Ntarmos, N. ; Triantafillou, P.

  • Author_Institution
    Comput. Eng. & Inf. Dept., Univ. of Patras, Rio, Greece
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    805
  • Lastpage
    816
  • Abstract
    Cloud key-value stores are becoming increasingly more important. Challenging applications, requiring efficient and scalable access to massive data, arise every day. We focus on supporting interval queries (which are prevalent in several data intensive applications, such as temporal querying for temporal analytics), an efficient solution for which is lacking. We contribute a compound interval index structure, comprised of two tiers: (i) the MRSegmentTree (MRST), a key-value representation of the Segment Tree, and (ii) the Endpoints Index (EPI), a column family index that stores information for interval endpoints. In addition to the above, our contributions include: (i) algorithms for efficiently constructing and populating our indices using MapReduce jobs, (ii) techniques for efficient and scalable index maintenance, and (iii) algorithms for processing interval queries. We have implemented all algorithms using HBase and Hadoop, and conducted a detailed performance evaluation. We quantify the costs associated with the construction of the indices, and evaluate our query processing algorithms using queries on real data sets. We compare the performance of our approach to two alternatives: the native support for interval queries provided in HBase, and the execution of such queries using the Hive query execution tool. Our results show a significant speedup, far outperforming the state of the art.
  • Keywords
    cloud computing; indexing; query processing; tree data structures; EPI; HBase; Hadoop; Hive query execution tool; MRST; MRSegmentTree; MapReduce job; cloud key-value store; column family index; data access; data intensive application; endpoints index; interval index structure; interval querying; key-value representation; query processing algorithm; real data set; scalable index maintenance; temporal analytics; temporal querying; Buildings; Crawlers; Indexing; Query processing; Vegetation; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544876
  • Filename
    6544876