DocumentCode :
1117994
Title :
Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System
Author :
Chatzichrisafis, Nikos ; Diakoloukas, Vassilios ; Digalakis, Vassilios ; Harizakis, Costas
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ. Crete, Chania
Volume :
15
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
928
Lastpage :
938
Abstract :
The porting of a speech recognition system to a new language is usually a time-consuming and expensive process since it requires collecting, transcribing, and processing a large amount of language-specific training sentences. This work presents techniques for improved cross-language transfer of speech recognition systems to new target languages. Such techniques are particularly useful for target languages where minimal amounts of training data are available. We describe a novel method to produce a language-independent system by combining acoustic models from a number of source languages. This intermediate language-independent acoustic model is used to bootstrap a target-language system by applying language adaptation. For our experiments, we use acoustic models of seven source languages to develop a target Greek acoustic model. We show that our technique significantly outperforms a system trained from scratch when less than 8 h of read speech is available
Keywords :
Gaussian processes; natural language processing; pattern clustering; speech recognition; Gaussian mixture clustering; cross-language transfer; intermediate language-independent acoustic model; language adaptation; language speech recognition system; language-independent system; language-specific training sentences; target Greek acoustic model; Acoustical engineering; Application software; Costs; Databases; Linear discriminant analysis; Natural languages; Speech processing; Speech recognition; Training data; Vectors; Clustering methods; languages; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.885259
Filename :
4100669
Link To Document :
بازگشت