DocumentCode :
2943081
Title :
Language identification using multiple knowledge sources
Author :
Parris, Eluned S. ; Carey, Michael J.
Author_Institution :
Ensigma Ltd., Chepstow, UK
Volume :
5
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
3519
Abstract :
Language identification experiments have been carried out on language pairs taken from seven of the languages in the OGI Multi-language Telephone Speech Corpus. This builds on previous work but introduces new techniques which are used to exploit the acoustic and phonetic differences between the languages. Subword hidden Markov models for the pair of languages are matched to unknown utterances resulting in three measures: the acoustic match, the phoneme frequencies and frequency histograms. Each of these measures gives 80 to 90% accuracy in discriminating language pairs. However these multiple knowledge sources are also combined to give improved results. Majority decision, logistic regression and a linear classifier were compared as data fusion techniques. The linear classifier performed the best giving an average accuracy of 89 to 93% on the pairs from the seven languages
Keywords :
hidden Markov models; natural languages; pattern classification; sensor fusion; speech recognition; acoustic differences; acoustic match; data fusion; frequency histograms; language identification experiment; language pair; linear classifier; logistic regression; majority decision; multiple knowledge sources; phoneme frequencies; phonetic differences; subword hidden Markov models; utterances; Acoustic measurements; Databases; Frequency measurement; Hidden Markov models; Histograms; Logistics; Natural languages; Neural networks; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479745
Filename :
479745
Link To Document :
بازگشت