Title :
Robust speech recognition by normalization of the acoustic space
Author :
Acero, Alejandro ; Stern, Richard M.
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Several algorithms are presented that increase the robustness of SPHINX, the CMU (Carnegie Mellon University) continuous-speech speaker-independent recognition systems, by normalizing the acoustic space via minimization of the overall VQ distortion. The authors propose an affine transformation of the cepstrum in which a matrix multiplication perform frequency normalization and a vector addition attempts environment normalization. The algorithms for environment normalization are efficient and improve the recognition accuracy when the system is tested on a microphone other than the one on which it was trained. The frequency normalization algorithm applies a different warping on the frequency axis to different speakers and it achieves a 10% decrease in error rate
Keywords :
acoustic signal processing; speech analysis and processing; speech recognition; CMU; Carnegie Mellon University; SPHINX; acoustic space normalisation; affine transformation; algorithms; continuous speech recognition; environment normalization; error rate; frequency normalization; matrix multiplication; microphone; minimization; recognition accuracy; speaker-independent recognition systems; vector addition; vector quantisation distortion; Acoustic distortion; Acoustic testing; Cepstrum; Error analysis; Frequency; Microphones; Minimization methods; Robustness; Speech recognition; System testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150483