DocumentCode
2690613
Title
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing
Author
Baluja, Shumeet ; Covell, Michele
Author_Institution
Google Inc., Mountain View, CA
Volume
2
fYear
2007
fDate
15-20 April 2007
Abstract
In this paper, we present waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.
Keywords
audio signal processing; computer vision; audio fingerprinting; cell-phone playback; compact fingerprints; competing noise; computer vision; data stream processing; memory usage; waveprint; Acoustic noise; Computer vision; Degradation; Fingerprint recognition; Frequency; Image retrieval; Large-scale systems; Signal processing algorithms; Spectrogram; Streaming media; Acoustic Applications; Acoustic Signal Processing; Music; Pattern Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2007.366210
Filename
4217383
Link To Document