DocumentCode
3136448
Title
A Simplified Early Auditory Model with Application in Speech/Music Classification
Author
Chu, Wei ; Champagne, Benoit
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, Que.
fYear
2006
fDate
38838
Firstpage
775
Lastpage
778
Abstract
The past decade has seen extensive research on audio classification and segmentation algorithms. However, the effect of background noise on the performance of classification has not been investigated widely. Recently, an early auditory model that calculates a so-called auditory spectrum has been employed in audio classification where excellent performance is reported along with robustness in noisy environment. Unfortunately, this early auditory model is characterized by high computational requirements and the use of nonlinear processing. In this paper, by introducing certain modifications we propose a simplified version of this model which is linear except for the calculation of the square-root value of the energy. A speech/music classification task is carried out to evaluate the classification performance wherein a support vector machine (SVM) is used as the classifier. Compared to a conventional FFT-based spectrum, both the original auditory spectrum and the proposed simplified auditory spectrum show more robust performance in noisy test cases. Test results also indicate that, with a reduced computational complexity, the performance of the proposed simplified auditory spectrum is close to that of the original auditory spectrum
Keywords
audio signal processing; music; signal classification; speech processing; support vector machines; FFT-based spectrum; audio classification; audio segmentation; auditory spectrum; music classification; noisy test case; nonlinear processing; simplified auditory model; speech classification; square-root value; support vector machine; Acoustic noise; Background noise; Classification algorithms; Computational complexity; Noise robustness; Speech analysis; Support vector machine classification; Support vector machines; Testing; Working environment noise; Audio classification; auditory spectrum; early auditory model; noise-robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
Conference_Location
Ottawa, Ont.
Print_ISBN
1-4244-0038-4
Electronic_ISBN
1-4244-0038-4
Type
conf
DOI
10.1109/CCECE.2006.277665
Filename
4054666
Link To Document