• DocumentCode
    294653
  • Title

    Speaker adaptation using combined transformation and Bayesian methods

  • Author

    Digalakis, Vassilios ; Neumeyer, Leonardo

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    680
  • Abstract
    The performance and robustness of a speech recognition system can be improved by adapting the speech models to the speaker, the channel and the task. In continuous mixture-density hidden Markov models the number of component densities is typically very large, and it may not be feasible to acquire a large amount of adaptation data for robust maximum-likelihood estimates. To solve this problem, we propose a constrained estimation technique for Gaussian mixture densities, and combine it with Bayesian techniques to improve its asymptotic properties. We evaluate our algorithms on the large-vocabulary Wall Street Journal corpus for nonnative speakers of American English. The recognition error rate is comparable to the speaker-independent accuracy achieved for native speakers
  • Keywords
    Bayes methods; Gaussian processes; adaptive signal processing; hidden Markov models; parameter estimation; speech processing; speech recognition; transforms; American English; Bayesian method; Bayesian techniques; Gaussian mixture densities; adaptation data; algorithms; asymptotic properties; channel adaptation; constrained estimation technique; continuous mixture-density hidden Markov models; large-vocabulary Wall Street Journal corpus; maximum-likelihood estimates; native speakers; nonnative speakers; recognition error rate; speaker adaptation; speaker-independent accuracy; speech models; speech recognition system; system performance; system robustness; transformation methods; Bayesian methods; Error analysis; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Robustness; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479785
  • Filename
    479785