DocumentCode :
784530
Title :
Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features
Author :
Lee, Chang-Hsing ; Shih, Jau-Ling ; Yu, Kun-Ming ; Lin, Hwai-San
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Chung HuaUniversity, Hsinchu
Volume :
11
Issue :
4
fYear :
2009
fDate :
6/1/2009 12:00:00 AM
Firstpage :
670
Lastpage :
682
Abstract :
In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. Modulation spectral analysis of every feature value will generate a corresponding modulation spectrum and all the modulation spectra can be collected to form a modulation spectrogram which exhibits the time-varying or rhythmic information of music signals. Each modulation spectrum is then decomposed into several logarithmically-spaced modulation subbands. The modulation spectral contrast (MSC) and modulation spectral valley (MSV) are then computed from each modulation subband. Effective and compact features are generated from statistical aggregations of the MSCs and MSVs of all modulation subbands. An information fusion approach which integrates both feature level fusion method and decision level combination method is employed to improve the classification accuracy. Experiments conducted on two different music datasets have shown that our proposed approach can achieve higher classification accuracy than other approaches with the same experimental setup.
Keywords :
cepstral analysis; modulation; music; signal classification; time-varying systems; automatic music genre classification; cepstral feature; decision level combination method; feature level fusion method; modulation spectral analysis; statistical aggregation; time-varying signal; Mel-frequency cepstral coefficients; modulation spectral analysis; music genre classification; normalized audio spectrum envelope; octave-based spectral contrast;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2009.2017635
Filename :
4895319
Link To Document :
بازگشت