DocumentCode
614636
Title
Music genre classification using multiscale scattering and sparse representations
Author
Xu Chen ; Ramadge, Peter J.
Author_Institution
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
fYear
2013
fDate
20-22 March 2013
Firstpage
1
Lastpage
6
Abstract
An effective music genre classication approach is proposed that combines the translation-invariance and deformation-robustness property of scattering coefficients and the discriminative power of sparse representation-based classifiers. We argue that these two approaches to feature selection and classification complement each other in reducing the in-class variability of data, and this should lead to enhanced performance. Our results show clear improvement over a variety of previous approaches. A music genre classication accuracy of approximately 91.2% on the GTZAN database is reported.
Keywords
music; pattern classification; sparse matrices; GTZAN database; deformation-robustness property; feature classification approach; feature selection approach; in-class data variability reduction; multiscale scattering representation; music genre classication approach; performance enhancement; scattering coefficients; sparse representation-based classifiers; translation-invariance property; Accuracy; Dictionaries; Feature extraction; Scattering; Training; Vectors; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences and Systems (CISS), 2013 47th Annual Conference on
Conference_Location
Baltimore, MD
Print_ISBN
978-1-4673-5237-6
Electronic_ISBN
978-1-4673-5238-3
Type
conf
DOI
10.1109/CISS.2013.6552324
Filename
6552324
Link To Document