• DocumentCode
    1476469
  • Title

    Maximum-likelihood stochastic-transformation adaptation of hidden Markov models

  • Author

    Diakoloukas, Vassilis D. ; Digalakis, Vassilios V.

  • Author_Institution
    Tech. Univ. of Crete, Chania, Greece
  • Volume
    7
  • Issue
    2
  • fYear
    1999
  • fDate
    3/1/1999 12:00:00 AM
  • Firstpage
    177
  • Lastpage
    187
  • Abstract
    The recognition accuracy in previous large vocabulary automatic speech recognition (ASR) systems is highly related to the existing mismatch between the training and testing sets. For example, dialect differences across the training and testing speakers result in a significant degradation in recognition performance. Some popular adaptation approaches improve the recognition performance of speech recognizers based on hidden Markov models with continuous mixture densities by using linear transformations to adapt the means, and possibly the covariances of the mixture Gaussians. The linear assumption, however, is too restrictive, and in this paper we propose a novel adaptation technique that adapts the means and, optionally, the covariances of the mixture Gaussians by using multiple stochastic transformations. We perform both speaker and dialect adaptation experiments, and we show that our method significantly improves the recognition accuracy and the robustness of our system. The experiments are carried out with SRI´s DECIPHER speech recognition system
  • Keywords
    Gaussian processes; adaptive estimation; covariance analysis; hidden Markov models; maximum likelihood estimation; speech recognition; DECIPHER speech recognition system; SRI; continuous mixture densities; covariances; dialect adaptation experiments; hidden Markov models; large vocabulary automatic speech recognition; linear transformations; maximum-likelihood stochastic-transformation adaptation; mean; mixture Gaussians; multiple stochastic transformations; recognition accuracy; recognition performance; speaker adaptation experiments; testing sets; training sets; Automatic speech recognition; Automatic testing; Degradation; Gaussian processes; Hidden Markov models; Robustness; Speech recognition; Stochastic processes; System testing; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.748122
  • Filename
    748122