• DocumentCode
    2932994
  • Title

    Batch, incremental and instantaneous adaptation techniques for speech recognition

  • Author

    Zavaliagkos, G. ; Schwartz, R. ; Makhoul, J.

  • Author_Institution
    Northeastern Univ., Boston, MA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    676
  • Abstract
    We present a framework for maximum a posteriori adaptation of large scale HMM speech recognizers. In this framework, we introduce mechanisms that take advantage of correlations present among HMM parameters in order to maximize the number of parameters that can be adapted by a limited number of observations. We are also separately exploring the feasibility of instantaneous adaptation techniques. Instantaneous adaptation attempts to improve recognition on a single sentence, the same sentence that is used to estimate the adaptation. We show that sizable gains (20-40% reduction in error rate) can be achieved by either batch or incremental adaptation for large vocabulary recognition of native speakers. The same techniques cut the error rate for recognition of non-native speakers by factors of 2 to 4, bringing their performance much closer to the native speaker performance. We also demonstrate that good improvements in performance (25-30%) are realized when instantaneous adaptation is used for recognition of non-native speakers
  • Keywords
    adaptive signal processing; correlation methods; error statistics; maximum likelihood estimation; speech processing; speech recognition; HMM parameters; batch adaptation techniques; correlations; error rate reduction; incremental adaptation techniques; instantaneous adaptation techniques; large scale HMM speech recognizers; large vocabulary recognition; maximum a posteriori adaptation; native speakers; nonnative speakers; sentence; speaker performance; speech recognition; Degradation; Error analysis; Hidden Markov models; Large-scale systems; Parameter estimation; Smoothing methods; Speech recognition; State estimation; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479688
  • Filename
    479688