Title :
Language identification using multiple knowledge sources
Author :
Parris, Eluned S. ; Carey, Michael J.
Author_Institution :
Ensigma Ltd., Chepstow, UK
Abstract :
Language identification experiments have been carried out on language pairs taken from seven of the languages in the OGI Multi-language Telephone Speech Corpus. This builds on previous work but introduces new techniques which are used to exploit the acoustic and phonetic differences between the languages. Subword hidden Markov models for the pair of languages are matched to unknown utterances resulting in three measures: the acoustic match, the phoneme frequencies and frequency histograms. Each of these measures gives 80 to 90% accuracy in discriminating language pairs. However these multiple knowledge sources are also combined to give improved results. Majority decision, logistic regression and a linear classifier were compared as data fusion techniques. The linear classifier performed the best giving an average accuracy of 89 to 93% on the pairs from the seven languages
Keywords :
hidden Markov models; natural languages; pattern classification; sensor fusion; speech recognition; acoustic differences; acoustic match; data fusion; frequency histograms; language identification experiment; language pair; linear classifier; logistic regression; majority decision; multiple knowledge sources; phoneme frequencies; phonetic differences; subword hidden Markov models; utterances; Acoustic measurements; Databases; Frequency measurement; Hidden Markov models; Histograms; Logistics; Natural languages; Neural networks; Speech recognition; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479745