DocumentCode
3549231
Title
Computer vision for music identification: video demonstration
Author
Ke, Yan ; Hoiem, Derek ; Sukthankar, Rahul
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
2
fYear
2005
fDate
20-25 June 2005
Abstract
This paper describes a demonstration video for our music identification system. The goal of music identification is to reliably recognize a song from a small sample of noisy audio. This problem is challenging because the recording is often corrupted by noise and because the audio sample will only match a small portion of the target song. Additionally, a practical music identification system should scale (in both accuracy and speed) to databases containing hundreds of thousands of songs. Recently, the music identification problem has attracted considerable attention. However, the task remains unsolved, particularly for noisy real-world queries. We cast music identification into an equivalent sub-image retrieval framework: identify the portion of a spectrogram image from the database that best matches a given query snippet. Our approach treats the spectrogram of each music clip as a 2D image and transforms music identification into a corrupted sub-image retrieval problem.
Keywords
audio databases; audio signal processing; computer vision; image denoising; image retrieval; information retrieval systems; music; video signal processing; 2D image; computer vision; music clip spectrogram; music identification system; noisy audio sample; noisy real-world queries; song databases; spectrogram image identification; subimage retrieval problem; video demonstration; Acoustic noise; Audio recording; Computer vision; Image retrieval; Multiple signal classification; Music information retrieval; Signal processing; Signal processing algorithms; Spectrogram; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on
ISSN
1063-6919
Print_ISBN
0-7695-2372-2
Type
conf
DOI
10.1109/CVPR.2005.106
Filename
1467583
Link To Document