Title :
Music genre classification using modulation spectral features and multiple prototype vectors representation
Author :
Lee, Chang-Hsing ; Chou, Chih-Hsun ; Lien, Cheng-Chang ; Fang, Jen-Cheng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Chung Hua Univ., Hsinchu, Taiwan
Abstract :
In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. A modulation spectrogram corresponding to the collection of modulation spectra of MFCC/OSC/NASE will be constructed. The modulation spectrum is then decomposed into several logarithmically spaced modulation subbands. For each modulation subband, a new set of modulation spectral features, including modulation spectral contrast (MSC), modulation spectral valley (MSV), modulation spectral energy (MSE), modulation spectral centroid (MSCEN) and modulation spectral flatness (MSF) are then computed from each modulation subband. To cope with the problem that the feature vectors extracted from the music tracks of identical music genre might differ significantly, each music genre is modeled with a number of representative prototype vectors generated by c-means clustering algorithm. An information fusion approach which integrates both feature level fusion method and decision level combination method is then employed to improve the classification accuracy. Experiments conducted on ISMIR 2004 music dataset have shown that our proposed approach can achieve higher classification accuracy than other approaches with the same experimental setup.
Keywords :
cepstral analysis; feature extraction; music; pattern clustering; sensor fusion; signal classification; vectors; ISMIR 2004 music dataset; MFCC-OSC-NASE; automatic music genre classification approach; c-means clustering algorithm; cepstral features; decision level combination method; feature level fusion method; feature vector extraction; information fusion approach; long-term modulation spectral analysis; modulation spectral centroid; modulation spectral contrast; modulation spectral energy; modulation spectral features; modulation spectral flatness; modulation spectral valley; modulation spectrogram; modulation subbands; multiple prototype vectors representation; Feature extraction; Mel frequency cepstral coefficient; Modulation; Multiple signal classification; Music; Support vector machine classification; Vectors; Mel-frequency cepstral coefficients; modulation spectral analysis; music genre classification; normalized audio spectrum envelope; octave-based spectral contrast;
Conference_Titel :
Image and Signal Processing (CISP), 2011 4th International Congress on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9304-3
DOI :
10.1109/CISP.2011.6100759