• DocumentCode
    588196
  • Title

    Partial replica selection for spatial datasets

  • Author

    Yun Tian ; Rhodes, Philip J.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Univ. of Mississippi, MS, USA
  • fYear
    2012
  • fDate
    8-12 Oct. 2012
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    The implementation of partial or incomplete replicas, which represent only a subset of a larger dataset, has been an active topic of research. Partial Spatial Replicas extend this functionality to spatial data, allowing us to distribute a spatial dataset in pieces over several locations. Accessing only a subset of a spatial replica usually results in a large number of relatively small read requests made to the underlying storage device. For this reason, an accurate model of disk access is important when working with spatial subsets. We make two primary contributions in this paper. First, we describe a model for disk access performance that takes filesystem prefetching into account and is sufficiently accurate for spatial replica selection. Second, making a few simplifying assumptions, we propose a fast replica selection algorithm for partial spatial replicas. The algorithm uses a greedy approach that attempts to maximize performance by choosing a collection of replica subsets that allow fast data retrieval by a client machine. Experiments show that the performance of the solution found by our algorithm is on average always at least 91% and 93.4% of the performance of the optimal solution in 4-node and 8-node tests respectively.
  • Keywords
    greedy algorithms; performance evaluation; query processing; replicated databases; storage management; visual databases; 4-node tests; 8-node tests; client machine; data retrieval; disk access performance model; fast replica selection algorithm; filesystem; greedy approach; incomplete replicas; partial replica selection; partial spatial replicas; performance maximization; prefetching; read requests; spatial data functionality; spatial datasets; storage device; Bandwidth; Clustering algorithms; Computational modeling; Data models; Prefetching; Servers; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E-Science (e-Science), 2012 IEEE 8th International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    978-1-4673-4467-8
  • Type

    conf

  • DOI
    10.1109/eScience.2012.6404473
  • Filename
    6404473