DocumentCode :
2492643
Title :
An Incremental Prefix Filtering Approach for the All Pairs Similarity Search Problem
Author :
Lam, Hoang Thanh ; Dung, Dinh Viet ; Perego, Raffaele ; Silvestri, Fabrizio
Author_Institution :
Univ. of Pisa, Pisa, Italy
fYear :
2010
fDate :
6-8 April 2010
Firstpage :
188
Lastpage :
194
Abstract :
Given a set of records, a threshold value t and a similarity function, we investigate the problem of finding all pairs of records such that similarity between each pair is above t. We propose several optimizations on the existing approaches to solve the problem. Our algorithm outperforms the state-of-the-art algorithms in the case with large and high-dimensional datasets. The speedup we achieved varied from 30% to 4-x depending on the similarity threshold and the dataset properties.
Keywords :
database management systems; optimisation; query formulation; search problems; all pairs similarity search problem; incremental prefix filtering approach; optimizations; Collaboration; Databases; Equations; Information filtering; Information filters; Length measurement; Proposals; Search problems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Conference (APWEB), 2010 12th International Asia-Pacific
Conference_Location :
Busan
Print_ISBN :
978-1-7695-4012-2
Electronic_ISBN :
978-1-4244-6600-9
Type :
conf
DOI :
10.1109/APWeb.2010.30
Filename :
5474136
Link To Document :
بازگشت