DocumentCode
2142957
Title
An Algorithm and Hardware Design for Very Fast Similarity Search in High Dimensional Space
Author
Singh, Vishwakarma ; Wenyu Jiang
Author_Institution
Dept. of Comput. Sci., UCSB, Santa Barbara, CA, USA
fYear
2010
fDate
14-16 Aug. 2010
Firstpage
426
Lastpage
431
Abstract
Similarity search in very high dimensions is vital for many scientific research activities as well as real applications. A high performance, scalable, and optimal quality solution to the problem still remains challenging. We propose a vote count based algorithm using p-stable distribution for approximate similarity search. Approximate similarity search effectively serves purpose for many real applications. Our algorithm is efficient and scalable with both dimension and database size. We also propose a novel hardware implementation of the algorithm using simple modification to Random Access Memory (RAM). The hardware design gives real time search for millions of points at practical cost. We empirically achieve high accuracy for query results using our algorithm on 128 dimensional synthetic and real datasets.
Keywords
approximation theory; query processing; random-access storage; statistical distributions; RAM; approximate similarity search; p-stable distribution; query results; random access memory; vote count based algorithm; Accuracy; Algorithm design and analysis; Artificial neural networks; Databases; Hardware; Nearest neighbor searches; Random access memory; Fast; Hardware; High Dimensional Space; Nearest Neighbor Search;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
978-1-4244-7964-1
Type
conf
DOI
10.1109/GrC.2010.114
Filename
5575953
Link To Document