DocumentCode :
3322538
Title :
Top-k Spatial Joins of Probabilistic Objects
Author :
Ljosa, Vebjorn ; Singh, Ambuj K.
Author_Institution :
Broad Inst., Cambridge Center, MIT & Harvard, Cambridge, MA
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
566
Lastpage :
575
Abstract :
Probabilistic data have recently become popular in applications such as scientific and geospatial databases. For images and other spatial datasets, probabilistic values can capture the uncertainty in extent and class of the objects in the images. Relating one such dataset to another by spatial joins is an important operation for data management systems. We consider probabilistic spatial join (PSJ) queries, which rank the results according to a score that incorporates both the uncertainties associated with the objects and the distances between them. We present algorithms for two kinds of PSJ queries: Threshold PSJ queries, which return all pairs that score above a given threshold, and top-k PSJ queries, which return the k top-scoring pairs. For threshold PSJ queries, we propose a plane sweep algorithm that, because it exploits the special structure of the problem, runs in 0(n (log n + k)) time, where n is the number of points and k is the number of results. We extend the algorithms to 2-D data and to top-k PSJ queries. To further speed up top-k PSJ queries, we develop a scheduling technique that estimates the scores at the level of blocks, then hands the blocks to the plane sweep algorithm. By finding high-scoring pairs early, the scheduling allows a large portion of the datasets to be pruned. Experiments demonstrate speed-ups of two orders of magnitude.
Keywords :
computational complexity; database indexing; probability; query processing; scheduling; statistical databases; visual databases; data management systems; index-based scheduling technique; plane sweep algorithm; probabilistic objects; probabilistic spatial join queries; spatial datasets; threshold PSJ queries; top-k PSJ queries; Airports; Application software; Biomedical imaging; Computer science; Image analysis; Image databases; Microscopy; Satellites; Spatial databases; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
Type :
conf
DOI :
10.1109/ICDE.2008.4497465
Filename :
4497465
Link To Document :
بازگشت