Title :
Audio scene analysis using parametric signal features
Author :
Asefi, Hamid ; Ghoraani, Behnaz ; Ye, Andy ; Krishnan, Sridhar
Author_Institution :
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, ON, Canada
Abstract :
The objective of this paper is to propose pole modeling-based features for audio classification in order to achieve a high classification performance. This paper investigates the suit- able pole modeling computation method and evaluates the proposed audio features in an audio database with 40 human speech samples, and 40 non human audio signals including aircraft, helicopter, drum, flutes, and piano sounds. An accuracy rate of 85% is acheived using the pole modeling features and linear discriminant analysis (LDA). We also compare the performance of the pole modeling features with two well-known audio features: Autoregressive (AR), and Mel-frequency Cepstral coefficients (MFCCs). We found that pole modeling is an appropriate tool for real-time audio scene analysis.
Keywords :
audio signal processing; feature extraction; aircraft sounds; audio classification; autoregressive; drum sounds; flutes sounds; helicopter sounds; linear discriminant analysis; mel-frequency cepstral coefficients; non human audio signals; parametric signal features; piano sounds; pole modeling computation; pole modeling-based features; real-time audio scene analysis; speech samples; Accuracy; Eigenvalues and eigenfunctions; Feature extraction; Humans; Matrix decomposition; Mel frequency cepstral coefficient; Polynomials; AR modeling; Audio Classification; Feature Extraction; MFCC; Pole modeling;
Conference_Titel :
Electrical and Computer Engineering (CCECE), 2011 24th Canadian Conference on
Conference_Location :
Niagara Falls, ON
Print_ISBN :
978-1-4244-9788-1
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2011.6030593