Title :
Music genre classification using multiscale scattering and sparse representations
Author :
Xu Chen ; Ramadge, Peter J.
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
Abstract :
An effective music genre classication approach is proposed that combines the translation-invariance and deformation-robustness property of scattering coefficients and the discriminative power of sparse representation-based classifiers. We argue that these two approaches to feature selection and classification complement each other in reducing the in-class variability of data, and this should lead to enhanced performance. Our results show clear improvement over a variety of previous approaches. A music genre classication accuracy of approximately 91.2% on the GTZAN database is reported.
Keywords :
music; pattern classification; sparse matrices; GTZAN database; deformation-robustness property; feature classification approach; feature selection approach; in-class data variability reduction; multiscale scattering representation; music genre classication approach; performance enhancement; scattering coefficients; sparse representation-based classifiers; translation-invariance property; Accuracy; Dictionaries; Feature extraction; Scattering; Training; Vectors; Wavelet transforms;
Conference_Titel :
Information Sciences and Systems (CISS), 2013 47th Annual Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4673-5237-6
Electronic_ISBN :
978-1-4673-5238-3
DOI :
10.1109/CISS.2013.6552324