DocumentCode :
337478
Title :
Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition
Author :
Kannan, Ashvin ; Khudanpur, Sanjeev
Author_Institution :
Nuance Commun., Menlo Park, CA, USA
Volume :
2
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
769
Abstract :
Two models of statistical dependence between the acoustic model parameters of a large vocabulary conversational speech recognition (LVCSR) system are investigated for the purpose of rapid speaker- and environment-adaptation from a very small amount of speech: (i) a Gaussian multiscale process governed by a stochastic linear dynamical system on a tree, and (ii) a simple hierarchical tree-structured prior. Both methods permit Bayesian (MAP) estimation of acoustic model parameters without parameter-tying even when no samples are available to independently estimate some parameters due to the limited amount of adaptation data. Modeling methodologies are contrasted, and comparative performance of the two on the Switchboard task is presented under identical test conditions for supervised and unsupervised adaptation with controlled amounts of adaptation speech. Both methods provide significant (1% absolute) gain in accuracy over adaptation methods that do not exploit the dependence between acoustic model parameters
Keywords :
Bayes methods; Gaussian processes; speech recognition; statistical analysis; tree data structures; Bayesian estimation; Gaussian multiscale process; MAP estimation; Switchboard task; acoustic model parameters; adaptation speech; hierarchical tree-structured prior; large vocabulary conversational speech recognition; parameter dependence; rapid adaptation; statistical dependence; stochastic linear dynamical system; supervised adaptation; tree-structured models; unsupervised adaptation; Acoustic testing; Bayesian methods; Gaussian noise; Hidden Markov models; Loudspeakers; Speech processing; Speech recognition; Stochastic systems; System testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.759782
Filename :
759782
Link To Document :
بازگشت