Title :
Batch, incremental and instantaneous adaptation techniques for speech recognition
Author :
Zavaliagkos, G. ; Schwartz, R. ; Makhoul, J.
Author_Institution :
Northeastern Univ., Boston, MA, USA
Abstract :
We present a framework for maximum a posteriori adaptation of large scale HMM speech recognizers. In this framework, we introduce mechanisms that take advantage of correlations present among HMM parameters in order to maximize the number of parameters that can be adapted by a limited number of observations. We are also separately exploring the feasibility of instantaneous adaptation techniques. Instantaneous adaptation attempts to improve recognition on a single sentence, the same sentence that is used to estimate the adaptation. We show that sizable gains (20-40% reduction in error rate) can be achieved by either batch or incremental adaptation for large vocabulary recognition of native speakers. The same techniques cut the error rate for recognition of non-native speakers by factors of 2 to 4, bringing their performance much closer to the native speaker performance. We also demonstrate that good improvements in performance (25-30%) are realized when instantaneous adaptation is used for recognition of non-native speakers
Keywords :
adaptive signal processing; correlation methods; error statistics; maximum likelihood estimation; speech processing; speech recognition; HMM parameters; batch adaptation techniques; correlations; error rate reduction; incremental adaptation techniques; instantaneous adaptation techniques; large scale HMM speech recognizers; large vocabulary recognition; maximum a posteriori adaptation; native speakers; nonnative speakers; sentence; speaker performance; speech recognition; Degradation; Error analysis; Hidden Markov models; Large-scale systems; Parameter estimation; Smoothing methods; Speech recognition; State estimation; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479688