DocumentCode :
294653
Title :
Speaker adaptation using combined transformation and Bayesian methods
Author :
Digalakis, Vassilios ; Neumeyer, Leonardo
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
680
Abstract :
The performance and robustness of a speech recognition system can be improved by adapting the speech models to the speaker, the channel and the task. In continuous mixture-density hidden Markov models the number of component densities is typically very large, and it may not be feasible to acquire a large amount of adaptation data for robust maximum-likelihood estimates. To solve this problem, we propose a constrained estimation technique for Gaussian mixture densities, and combine it with Bayesian techniques to improve its asymptotic properties. We evaluate our algorithms on the large-vocabulary Wall Street Journal corpus for nonnative speakers of American English. The recognition error rate is comparable to the speaker-independent accuracy achieved for native speakers
Keywords :
Bayes methods; Gaussian processes; adaptive signal processing; hidden Markov models; parameter estimation; speech processing; speech recognition; transforms; American English; Bayesian method; Bayesian techniques; Gaussian mixture densities; adaptation data; algorithms; asymptotic properties; channel adaptation; constrained estimation technique; continuous mixture-density hidden Markov models; large-vocabulary Wall Street Journal corpus; maximum-likelihood estimates; native speakers; nonnative speakers; recognition error rate; speaker adaptation; speaker-independent accuracy; speech models; speech recognition system; system performance; system robustness; transformation methods; Bayesian methods; Error analysis; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Robustness; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479785
Filename :
479785
Link To Document :
بازگشت