DocumentCode :
2694087
Title :
Using dtw based unsupervised segmentation to improve the vocal part detection in pop music
Author :
Xiao, Linxing ; Zhou, Jie ; Zhang, Tong
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing
fYear :
2008
fDate :
June 23 2008-April 26 2008
Firstpage :
1193
Lastpage :
1196
Abstract :
Vocal part detection, which plays an important role in music information retrieval, is still a tough task so far. Previous works focused on short time features, which cannot capture some essential long term characteristics of singing. In this paper, we propose a Dynamic Time Warping based unsupervised segmentation algorithm to divide a pop song into homogeneous segments, which contain either vocal or pure music sound. This procedure makes it possible to design long term feature or classification schema to improve the accuracy of vocal part detection. We also present a segment level classification schema based on the result of segmentation. It will be shown that the classification accuracy is significantly improved.
Keywords :
audio signal processing; information retrieval; music; signal classification; signal detection; unsupervised learning; DTW; dynamic time warping; music information retrieval; pop music; signal classification; unsupervised segmentation algorithm; vocal part detection; Acoustic signal detection; Cepstrum; Dynamic programming; Hidden Markov models; Length measurement; Linear predictive coding; Mel frequency cepstral coefficient; Multiple signal classification; Music information retrieval; Time measurement; Acoustic signal detection; Dynamic programming; Pattern classification; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
Type :
conf
DOI :
10.1109/ICME.2008.4607654
Filename :
4607654
Link To Document :
بازگشت