• DocumentCode
    2690613
  • Title

    Audio Fingerprinting: Combining Computer Vision & Data Stream Processing

  • Author

    Baluja, Shumeet ; Covell, Michele

  • Author_Institution
    Google Inc., Mountain View, CA
  • Volume
    2
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    In this paper, we present waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.
  • Keywords
    audio signal processing; computer vision; audio fingerprinting; cell-phone playback; compact fingerprints; competing noise; computer vision; data stream processing; memory usage; waveprint; Acoustic noise; Computer vision; Degradation; Fingerprint recognition; Frequency; Image retrieval; Large-scale systems; Signal processing algorithms; Spectrogram; Streaming media; Acoustic Applications; Acoustic Signal Processing; Music; Pattern Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366210
  • Filename
    4217383