Title :
Hierarchical audio classification using cepstral modulation ratio regressions based on Legendre polynomials
Author :
Nagathil, Anil ; Göttel, Peter ; Martin, Rainer
Author_Institution :
Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Bochum, Germany
Abstract :
In this work we present a scalable feature set which is obtained by fitting orthogonal polynomials to the normalized modulation spectrum of cepstral coefficients and which can be easily adapted to different classification tasks. The performance of the feature set is investigated in a hierarchically structured audio signal classification experiment and compared with other approaches reported in the literature. For the root categories speech, music and noise a classification accuracy of 95% is achieved. Subclasses such as male and female speech or different noise types are classified with an accuracy of 95% and 85%, respectively. In a 10-category musical genre discrimination experiment the proposed features exhibit an accuracy of 61%.
Keywords :
Legendre polynomials; audio signal processing; cepstral analysis; feature extraction; regression analysis; signal classification; Legendre polynomial; cepstral modulation ratio regression; hierarchical audio classification; hierarchically structured audio signal classification; musical genre discrimination experiment; normalized modulation spectrum; Accuracy; Approximation methods; Cepstral analysis; Modulation; Noise; Polynomials; Speech; cepstral analysis; pattern recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946921