Title :
Cepstral modulation ratio regression (CMRARE) parameters for audio signal analysis and classification
Author :
Martin, Rainer ; Nagathil, Anil
Author_Institution :
Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Bochum
Abstract :
In this paper we propose a new set of parameters for audio signal analysis and classification. These parameters are regressions computed on the normalized modulation spectrum of high-resolution cepstral coefficients. The parameter set is scalable in its size and gives a compact representation of the modulation content of speech and other audio signals. These parameters as well as the regression approximation error are well suited for characterizing audio signals in a unified framework. In particular we use a set of eight parameters in a speech/music/noise classification task in which we achieve a classification accuracy which compares very well with other approaches including static and dynamic MFCCs.
Keywords :
approximation theory; audio signal processing; cepstral analysis; feature extraction; modulation; regression analysis; signal classification; signal representation; signal resolution; speech processing; CMRARE parameter; audio signal classification; cepstral modulation ratio regression; feature extraction; regression approximation error; signal resolution; speech signal representation; Acoustics; Automatic speech recognition; Cepstral analysis; Cepstrum; Discrete Fourier transforms; Mel frequency cepstral coefficient; Polynomials; Signal analysis; Speech enhancement; Working environment noise; cepstrum; modulation spectrum; signal classification;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959585