DocumentCode :
3294410
Title :
Language identification using shifted delta cepstra
Author :
Kohler, M.A. ; Kennedy, M.
Author_Institution :
Dept. of Defense, Fort Meade, MD, USA
Volume :
3
fYear :
2002
fDate :
4-7 Aug. 2002
Abstract :
A variety of speech identification technologies currently use Gaussian mixture models. Until recently, however, they were considered inferior to parallel phone recognition language modeling for identifying the language of a speaker. Experiments in the last year have shown that Gaussian mixture models can provide high performance language identification when shifted delta cepstra are used as the feature set. Not only can they achieve comparative or even superior performance to parallel phone recognition language modeling, Gaussian mixture models also require less computation. Performance can be further improved by altering the shifted delta cepstra parameters and the number of mixtures. The optimal parameter set varies depending on the languages to be identified. This paper describes a method for finding the optimal parameters for identifying a set of languages, specifies these parameters for a language identification task, and provides a performance comparison.
Keywords :
Gaussian distribution; speech recognition; Gaussian mixture models; language identification; optimal parameter set; optimal parameters; parallel phone recognition language modeling; shifted delta cepstra; speech identification technologies; Algorithm design and analysis; Cepstral analysis; Concurrent computing; Humans; Natural languages; Speech processing; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
Print_ISBN :
0-7803-7523-8
Type :
conf
DOI :
10.1109/MWSCAS.2002.1186972
Filename :
1186972
Link To Document :
بازگشت