• DocumentCode
    2142957
  • Title

    An Algorithm and Hardware Design for Very Fast Similarity Search in High Dimensional Space

  • Author

    Singh, Vishwakarma ; Wenyu Jiang

  • Author_Institution
    Dept. of Comput. Sci., UCSB, Santa Barbara, CA, USA
  • fYear
    2010
  • fDate
    14-16 Aug. 2010
  • Firstpage
    426
  • Lastpage
    431
  • Abstract
    Similarity search in very high dimensions is vital for many scientific research activities as well as real applications. A high performance, scalable, and optimal quality solution to the problem still remains challenging. We propose a vote count based algorithm using p-stable distribution for approximate similarity search. Approximate similarity search effectively serves purpose for many real applications. Our algorithm is efficient and scalable with both dimension and database size. We also propose a novel hardware implementation of the algorithm using simple modification to Random Access Memory (RAM). The hardware design gives real time search for millions of points at practical cost. We empirically achieve high accuracy for query results using our algorithm on 128 dimensional synthetic and real datasets.
  • Keywords
    approximation theory; query processing; random-access storage; statistical distributions; RAM; approximate similarity search; p-stable distribution; query results; random access memory; vote count based algorithm; Accuracy; Algorithm design and analysis; Artificial neural networks; Databases; Hardware; Nearest neighbor searches; Random access memory; Fast; Hardware; High Dimensional Space; Nearest Neighbor Search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2010 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4244-7964-1
  • Type

    conf

  • DOI
    10.1109/GrC.2010.114
  • Filename
    5575953