DocumentCode :
2676003
Title :
Environmental sound recognition with CELP-based features
Author :
Tsau, Enshuo ; Kim, Seung-Hwan ; Kuo, C. -C Jay
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2011
fDate :
June 30 2011-July 1 2011
Firstpage :
1
Lastpage :
4
Abstract :
In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data like speech and music. However, their performance for the ESR problem is limited. An audio signal can be well preserved by its highly compressed CELP bit streams, which motivates us to study the CELP-based features for the audio scene recognition problem. We present a way to extract a set of features from the CELP bit streams and compare the performance of ESR using different feature sets with the Bayesian network classifier. It is shown by experimental results that the CELP-based features outperform the MFCC features in the ESR problem by a significant 9% margin in average and the integrated MFCC and CELP-based feature set can even reach a correct classification rate of 95.2% using the Bayesian network classifier.
Keywords :
audio signal processing; belief networks; prediction theory; signal classification; Bayesian network classifier; CELP based features; Mel frequency cepstral coefficients; audio scene recognition problem; audio signal; code excited linear prediction; environmental sound recognition; Bayesian methods; Feature extraction; Image analysis; Mel frequency cepstral coefficient; Performance evaluation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Circuits and Systems (ISSCS), 2011 10th International Symposium on
Conference_Location :
lasi
Print_ISBN :
978-1-61284-944-7
Type :
conf
DOI :
10.1109/ISSCS.2011.5978729
Filename :
5978729
Link To Document :
بازگشت