Title :
Sound classification based on temporal feature integration
Author :
Ntalampiras, Stavros ; Potamitis, Ilyas ; Fakotakis, Nikos
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Patras, Patras, Greece
Abstract :
The present work contributes to the field of generalized sound classification. We extensively examine the performance of the next three feature sets: a) MPEG-7 Audio Spectrum Projection, b) MFCC (using an alternative method for their extraction) and c) a group derived utilizing critical band based wavelet packets. Subsequently three types of temporal feature integration strategies are applied on the extracted instant values: a) short-term statistics, b) spectral moments and c) two autoregressive functions. During the experimental phase, we organize ten sound classes using professional sound effects collections of high quality. The density of each category is approximated with left-right hidden Markov models. Comparable results with respect to all the feature sets as well as integration methods are provided, which demonstrate the superiority of the short-term statistics method.
Keywords :
audio signal processing; autoregressive processes; feature extraction; hidden Markov models; waveform analysis; MFCC; MPEG-7 audio spectrum projection; autoregressive functions; left-right hidden Markov models; short-term statistics; sound classification; spectral moments; temporal feature integration; wavelet packets; Acoustic signal processing; Audio databases; Communication system control; Hidden Markov models; MPEG 7 Standard; Mel frequency cepstral coefficient; Process control; Spatial databases; Statistics; Wavelet packets;
Conference_Titel :
Communications, Control and Signal Processing (ISCCSP), 2010 4th International Symposium on
Conference_Location :
Limassol
Print_ISBN :
978-1-4244-6285-8
DOI :
10.1109/ISCCSP.2010.5463315