DocumentCode :
2240585
Title :
Extracting vocal melody from karaoke music audio
Author :
Zhu, Yongwei ; Gao, Sheng
Author_Institution :
Inst. for Infocomm Res., A-STAR, Singapore, Singapore
fYear :
2005
fDate :
6-8 July 2005
Abstract :
Extracting the melody from polyphonic musical audio is a nontrivial research problem. This paper presents an approach for vocal melody extraction from dual channel Karaoke music audio. The extracted melody corresponds to the singing voice in the original performance channel, which can then be used for melody-based music retrieval. In the proposed technique, audio signals from both the accompaniment channel and the original performance channel are analyzed. The note partials are firstly extracted from the signal, which is represented in constant-Q transform frequency domain. Then the volume balance between the two channels is estimated based on signal approximation in the sub-bands. Finally, the pitch corresponding to the singing voice is identified based on the note partial differences between the two channels. The extracted melody is represented as a sequence of pitch values. This technique assumes that the two channels have similar accompaniment instrument performance except for the singing voices. Experimental result on 40 Karaoke music audios has shown the performance of the proposed technique. The pitch extraction rate is above 70% and melody retrieval accuracy in an 800-tune-database is 90%.
Keywords :
audio signal processing; electronic music; feature extraction; frequency-domain analysis; information retrieval; music; transforms; audio signal; constant-Q transform; dual channel Karaoke system; frequency domain analysis; melody-based music retrieval; pitch extraction rate; polyphonic music audio; subband signal approximation; vocal melody extraction; DVD; Data mining; Frequency domain analysis; Indexing; Instruments; Multiple signal classification; Music information retrieval; Performance analysis; Signal analysis; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521620
Filename :
1521620
Link To Document :
بازگشت