DocumentCode :
1939154
Title :
Robust speech recognition by normalization of the acoustic space
Author :
Acero, Alejandro ; Stern, Richard M.
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
893
Abstract :
Several algorithms are presented that increase the robustness of SPHINX, the CMU (Carnegie Mellon University) continuous-speech speaker-independent recognition systems, by normalizing the acoustic space via minimization of the overall VQ distortion. The authors propose an affine transformation of the cepstrum in which a matrix multiplication perform frequency normalization and a vector addition attempts environment normalization. The algorithms for environment normalization are efficient and improve the recognition accuracy when the system is tested on a microphone other than the one on which it was trained. The frequency normalization algorithm applies a different warping on the frequency axis to different speakers and it achieves a 10% decrease in error rate
Keywords :
acoustic signal processing; speech analysis and processing; speech recognition; CMU; Carnegie Mellon University; SPHINX; acoustic space normalisation; affine transformation; algorithms; continuous speech recognition; environment normalization; error rate; frequency normalization; matrix multiplication; microphone; minimization; recognition accuracy; speaker-independent recognition systems; vector addition; vector quantisation distortion; Acoustic distortion; Acoustic testing; Cepstrum; Error analysis; Frequency; Microphones; Minimization methods; Robustness; Speech recognition; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150483
Filename :
150483
Link To Document :
بازگشت