DocumentCode :
323777
Title :
A study on speaker normalization using vocal tract normalization and speaker adaptive training
Author :
Welling, L. ; Haeb-Umbach, R. ; Zubert, X. ; Haberland, N.
Author_Institution :
Tech. Hochschule Aachen, Germany
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
797
Abstract :
Although speaker normalization is attempted in very different manners, vocal tract normalization (VTN) and speaker adaptive training (SAT) share many common properties. We show that both lead to more compact representations of the phonetically relevant variations of the training data and that both achieve improved error rate performance only if a complementary normalization or adaptation operation is conducted on the test data. Algorithms for fast test speaker enrolment are presented for both normalization methods: in the framework of SAT, a pre-transformation step is proposed, which alone, i.e. without subsequent unsupervised MLLR adaptation, reduces the error rate by almost 10% on the WSJ 5k test sets. For VTN, the use of a Gaussian mixture model makes obsolete a first recognition pass to obtain a preliminary transcription of the test utterance at hardly any loss in performance
Keywords :
Gaussian processes; adaptive systems; hidden Markov models; maximum likelihood estimation; signal representation; speech processing; speech recognition; Gaussian mixture model; HMM training; WSJ test sets; algorithms; automatic speech recognizers; error rate performance; error rate reduction; fast test speaker enrolment; maximum likelihood linear regression; phonetically relevant variations; pre-transformation step; speaker adaptive training; speaker normalization; test utterance; training data representation; transcription; vocal tract normalization; Acoustic testing; Error analysis; Frequency; Hidden Markov models; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Performance evaluation; Performance loss; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675385
Filename :
675385
Link To Document :
بازگشت