DocumentCode :
310572
Title :
Studies in transformation-based adaptation
Author :
Nagesha, Venkatesh ; Gillick, Larry
Author_Institution :
Dragon Systems Inc., Newton, MA, USA
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1031
Abstract :
This paper studies the use of transformation-based speaker adaptation in improving the performance of large vocabulary continuous speech recognition systems. We present a formulation of the adaptation procedure that is simpler than existing methods. Our experiments demonstrate that speaker normalization continues to be important even after significant amounts of speaker adaptation. An automatic clustering algorithm is compared to human expertise in sorting output distributions into collections that share the same transformation. We quantify improvements over standard Bayesian (by maximum a posteriori or MAP) adaptation in terms of (a) speed of adaptation, and (b) robustness to transcription errors. Finally, we discuss the use of speaker transformations in the training process
Keywords :
speech recognition; transforms; adaptation speed; automatic clustering algorithm; large vocabulary continuous speech recognition systems; output distributions; performance; robustness; speaker normalization; speaker transformations; training process; transcription errors; transformation-based speaker adaptation; Bayesian methods; Clustering algorithms; Computational efficiency; Humans; Linear regression; Loudspeakers; Robustness; Sorting; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596116
Filename :
596116
Link To Document :
بازگشت