Title :
Audio fingerprinting: nearest neighbor search in high dimensional binary spaces
Author :
Miller, Matthew L. ; Rodriguez, Manuel Acevedo ; Cox, Ingemar J.
Author_Institution :
NEC Res. Inst., Princeton, NJ, USA
Abstract :
Audio fingerprinting is an emerging research field in which a song must be recognized by matching an extracted "fingerprinting" to a database of known fingerprints. Audio fingerprinting must solve the two key problems of representation and search. In this paper, we are given an 8192-bit binary representation of each five second interval of a song and therefore focus our attention on the problem of high-dimensional nearest neighbor search. High dimensional nearest neighbor search is known to suffer from the curse of dimensionality, i.e. as the dimension increases, the computational or memory costs increase exponentially. However, recently, there has been significant work of efficient, approximate, search algorithms. We build on this work and describe preliminary results of a probabilistic search algorithm. We describe the data structures and search algorithm used and then present experimental results for a database of 1,000 songs containing 12,217,111 fingerprints.
Keywords :
audio signal processing; data structures; feature extraction; search problems; speech recognition; 8192 bit; audio fingerprinting; dimensionality curse; fingerprint database; fingerprint extraction; high dimensional binary spaces; memory cost; nearest neighbor search; probabilistic search algorithms; Audio databases; Broadcast technology; Computational efficiency; Data structures; Fingerprint recognition; Hamming distance; Monitoring; National electric code; Nearest neighbor searches; Pattern recognition;
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
DOI :
10.1109/MMSP.2002.1203277