DocumentCode
3594341
Title
Perceptual speech modeling for noisy speech recognition
Author
Wu, Chung-Hsien ; Chiu, Yu-Hsien ; Lim, Huigan
Author_Institution
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, ROC
Volume
1
fYear
2002
Abstract
This paper proposes a perceptual modeling approach with a two-stage recognition to deal with the issues of recognition degradation in noisy environment. The auditory masking effect is used for speech enhancement and acoustic modeling in order to overcome the model inconsistencies between training speech and noisy input. In the two-stage recognition, the maximum a posteriori (MAP) based adaptation algorithm is used to incrementally adapt the noise model. In order to evaluate our proposed approach, a Mandarin keyword spotting system was constructed. The experimental results show our proposed method achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods for both in 70km/hr (10.3dB) and 90km/hr (6.4dB) car environments.
Keywords
Adaptation model; Auditory system; Hidden Markov models; Noise measurement; Robustness; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743735
Filename
5743735
Link To Document