DocumentCode :
2942413
Title :
Language identification using phoneme recognition and phonotactic language modeling
Author :
Zissman, M.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA
Volume :
5
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
3503
Abstract :
A language identification technique using multiple single-language phoneme recognizers followed by n-gram language models yielded top performance at the March 1994 NIST language identification evaluation. Since the NIST evaluation, work has been aimed at further improving performance by using the acoustic likelihoods emitted from gender-dependent phoneme recognizers to weight the phonotactic likelihoods output from gender-dependent language models. We have investigated the effect of restricting processing to the most highly discriminating n-grams, and we have also added explicit duration modeling at the phonotactic level. On the OGI Multi-language Telephone Speech Corpus, accuracy on an 11-language, closed-set, language identification task has risen to 89% on 45-s utterances and 79% on 10-s utterances. Two-language classification accuracy is 98% and 95% for the 45-s and 10-s utterances, respectively. Finally, we have started to apply these same techniques to the problem of dialect identifications
Keywords :
acoustic signal processing; grammars; natural languages; speech processing; speech recognition; telephony; NIST language identification; OGI Multi-language Telephone Speech Corpus; acoustic likelihoods; dialect identification; duration modeling; gender-dependent language models; gender-dependent phoneme recognizers; language identification; n-gram language models; phoneme recognition; phonotactic language modeling; phonotactic likelihoods output; single-language phoneme recognizers; two-language classification accuracy; Acoustic emission; Computational efficiency; Electronic mail; Humans; Laboratories; NIST; Natural languages; Real time systems; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479741
Filename :
479741
Link To Document :
بازگشت