DocumentCode :
1879451
Title :
Automatic music genre classification using timbral texture and rhythmic content features
Author :
Baniya, Babu Kaji ; Ghimire, Deepak ; Joonwhoan Lee
Author_Institution :
Div. of Comput. Sci. & Eng., Chonbuk Nat. Univ., Jeonju, South Korea
fYear :
2015
fDate :
1-3 July 2015
Firstpage :
434
Lastpage :
443
Abstract :
Music genre classification is a vital component for the music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains the Mel-frequency Cepstral Coefficient (MFCC) with other several spectral features. Before choosing a timbral feature we explore which feature contributes a less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as the classifier for classifying the genres. Based on the proposed feature sets and classifier, experiments are performed with two well-known datasets: GTZAN and the ISMIR2004 databases with ten and six different music genres, respectively. The proposed method acquires better and competitive classification accuracy compared to the existing approaches for both data sets.
Keywords :
audio signal processing; feature extraction; information retrieval systems; learning (artificial intelligence); music; signal classification; ELM with bagging; GTZAN database; ISMIR2004 database; MFCC; Mel-frequency cepstral coefficient; audio feature classifier; audio feature extraction; classification accuracy; extreme learning machine; feature dimension reduction; genre discrimination; music genre classification; music information retrieval system; rhythmic content feature; timbral texture; Bagging; Feature extraction; Histograms; Mel frequency cepstral coefficient; Speech; Standards; Classification; ELM (Extreme Learning Machine) with bagging; covariance matrix; music genres; rhythmic contents; timbral texture;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Communication Technology (ICACT), 2015 17th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-8-9968-6504-9
Type :
conf
DOI :
10.1109/ICACT.2015.7224907
Filename :
7224907
Link To Document :
بازگشت