Title :
Classification of audio scenes using Narrow-Band Autocorrelation features
Author :
Valero, Xavier ; Alías, Francesc
Author_Institution :
Grup de Recerca en Tecnologies Media, La Salle - Univ. Ramon Llull, Barcelona, Spain
Abstract :
Multiple single sound events of very different characteristics might coincide in a given space and time, thus composing complex audio scenes. In that context, defining signal features capable of effectively analyzing the holistic audio scenes is a challenging task. This paper introduces a set of features that consider the temporal, spectral and perceptual characteristics of the audio scene signals. Specifically, the features are obtained from the autocorrelation function of band-pass signals computed after applying a Mel filter bank. The so-called Narrow-Band Autocorrelation (NB-ACF) features are compared to state-of-the-art signal features on a corpus of 4 hours composed of 15 audio scenes. Regardless of the learning algorithm employed, the NB-ACF attains the highest averaged recognition rates: 2.3 % higher than Mel Frequency Cepstral Coefficients and 5.6 % higher than Discrete Wavelet Coefficients.
Keywords :
audio signal processing; discrete wavelet transforms; feature extraction; learning (artificial intelligence); signal classification; NB-ACF features; audio scene classification; audio scene signals; band-pass signal autocorrelation function; complex audio scenes; discrete wavelet coefficients; learning algorithm; mel frequency cepstral coefficients; narrow-band autocorrelation features; perceptual characteristics; signal features; single sound events; spectral characteristics; temporal characteristics; Context; Correlation; Filter banks; Humans; Mel frequency cepstral coefficient; Support vector machines; Audio classification; autocorrelation function; environmental sound recognition; feature extraction; narrow-band signal analysis;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
Print_ISBN :
978-1-4673-1068-0