DocumentCode :
3549231
Title :
Computer vision for music identification: video demonstration
Author :
Ke, Yan ; Hoiem, Derek ; Sukthankar, Rahul
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
2
fYear :
2005
fDate :
20-25 June 2005
Abstract :
This paper describes a demonstration video for our music identification system. The goal of music identification is to reliably recognize a song from a small sample of noisy audio. This problem is challenging because the recording is often corrupted by noise and because the audio sample will only match a small portion of the target song. Additionally, a practical music identification system should scale (in both accuracy and speed) to databases containing hundreds of thousands of songs. Recently, the music identification problem has attracted considerable attention. However, the task remains unsolved, particularly for noisy real-world queries. We cast music identification into an equivalent sub-image retrieval framework: identify the portion of a spectrogram image from the database that best matches a given query snippet. Our approach treats the spectrogram of each music clip as a 2D image and transforms music identification into a corrupted sub-image retrieval problem.
Keywords :
audio databases; audio signal processing; computer vision; image denoising; image retrieval; information retrieval systems; music; video signal processing; 2D image; computer vision; music clip spectrogram; music identification system; noisy audio sample; noisy real-world queries; song databases; spectrogram image identification; subimage retrieval problem; video demonstration; Acoustic noise; Audio recording; Computer vision; Image retrieval; Multiple signal classification; Music information retrieval; Signal processing; Signal processing algorithms; Spectrogram; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on
ISSN :
1063-6919
Print_ISBN :
0-7695-2372-2
Type :
conf
DOI :
10.1109/CVPR.2005.106
Filename :
1467583
Link To Document :
بازگشت