DocumentCode :
3166143
Title :
On using the auditory image model and invariant-integration for noise robust automatic speech recognition
Author :
Müller, Florian ; Mertins, Alfred
Author_Institution :
Inst. for Signal Process., Univ. of Lubeck, Lubeck, Germany
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4905
Lastpage :
4908
Abstract :
Commonly used feature extraction methods for automatic speech recognition (ASR) incorporate only rudimentary psychoacoustic findings. Several works showed that a physiologically closer auditory processing during the feature extraction stage can enhance the robustness of an ASR system in noisy environments. The “auditory image model” (AIM) is such a more sophisticated computational model. In this work we show how invariant integration can be applied in the feature space given by the AIM, and we analyze the performance of the resulting features under noisy conditions on the Aurora-2 task. Furthermore, we show that previously presented features based on power-normalization and invariant integration benefit from the AIM-based integration features when the feature vectors are combined with each other.
Keywords :
feature extraction; integration; speech recognition; AIM; ASR system; Aurora-2 task; auditory image model; feature extraction stage; invariant integration; invariant-integration; noise robust automatic speech recognition; physiological closer auditory processing; power-normalization; sophisticated computational model; Accuracy; Feature extraction; Robustness; Speech; Speech processing; Training; Transforms; Robust speech recognition; auditory processing; feature extraction; invariant integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289019
Filename :
6289019
Link To Document :
بازگشت