DocumentCode :
2180075
Title :
Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments
Author :
Moritz, Niko ; Anemüller, Jörn ; Kollmeier, Birger
Author_Institution :
Project Group Hearing, Speech & Audio Technol., Fraunhofer IDMT, Oldenburg, Germany
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5492
Lastpage :
5495
Abstract :
In this contribution we present a feature extraction method that relies on the modulation-spectral analysis of amplitude fluctuations within sub-bands of the acoustic spectrum by a STFT. The experimental results indicate that the optimal temporal filter extension for amplitude modulation analysis is around 310 ms. It is also demonstrated that the phase information of the modulation spectrum contains important cues for speech recognition. In this context, the advantage of an odd analysis basis function is considered. The best presented features reached a total relative improvement of 53.5% for clean-condition training on Aurora-2. Furthermore, it is shown that modulation features are more robust against room reverberation than conventional cepstral and dynamic features and that they strongly benefit from a high early-to-late energy ratio of the characteristic RIR.
Keywords :
feature extraction; speech recognition; STFT; acoustic spectrum; amplitude modulation spectrogram; clean-condition training; feature extraction method; noisy environment; reverberant environments; robust speech recognition; Amplitude modulation; Frequency modulation; Reverberation; Robustness; Speech; Speech recognition; Amplitude Modulation Spectrogram (AMS); Automatic Speech Recognition (ASR); Feature Extraction; Phase; Reverberation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947602
Filename :
5947602
Link To Document :
بازگشت