• DocumentCode
    230751
  • Title

    A distributed polygon retrieval algorithm using MapReduce

  • Author

    Qiulei Guo ; Palanisamy, Balaji ; Karimi, Hassan A.

  • Author_Institution
    Sch. of Inf. Sci., Univ. of Pittsburgh, Pittsburgh, PA, USA
  • fYear
    2014
  • fDate
    22-25 Oct. 2014
  • Firstpage
    435
  • Lastpage
    436
  • Abstract
    The proliferation of data acquisition devices like 3D laser scanners had led to the burst of large-scale spatial terrain data which imposes many challenges to spatial data analysis and computation. With the advent of several emerging collaborative cloud technologies, a natural and cost-effective approach to managing such large-scale data is to store and share such datasets in a publicly hosted cloud service and process the data within the cloud itself using modern distributed computing paradigms such as MapReduce. For several key spatial data analysis and computation problems, polygon retrieval is a fundamental operation which is often computed under real-time constraints. However, existing sequential algorithms fail to meet this demand effectively given that terrain data in recent years have witnessed an unprecedented growth in both volume and rate. In this work, we develop a MapReduce-based parallel polygon retrieval algorithm which aims at minimizing the IO and CPU loads of the map and reduce tasks during spatial data processing. The results of the preliminary experiments on a Hadoop cluster demonstrate that the proposed techniques are scalable and lead to more than 35% reduction in execution time of the polygon retrieval operation over existing distributed algorithms.
  • Keywords
    cloud computing; data acquisition; data analysis; information retrieval; Hadoop cluster; MapReduce-based parallel polygon retrieval algorithm; collaborative cloud technologies; data acquisition devices; distributed computing paradigms; distributed polygon retrieval algorithm; large-scale spatial terrain data; publicly hosted cloud service; spatial data analysis; spatial data processing; Arrays; Earth; Elevators; Random access memory; Three-dimensional displays; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2014 International Conference on
  • Conference_Location
    Miami, FL
  • Type

    conf

  • Filename
    7014591