Title :
Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling
Author :
Zissman, Marc A. ; Singer, Elliot
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
The paper compares the performance of four approaches to automatic language identification (LID) of telephone speech messages: Gaussian mixture model classification (GMM), language-independent phoneme recognition followed by language-dependent language modeling (PRLM), parallel PRLM (PRLM-P), and language-dependent parallel phoneme recognition (PPR). These approaches span a wide range of training requirements and levels of recognition complexity. All approaches were tested on the development test subset of the OGI multi-language telephone speech corpus. Generally, system performance was directly related to system complexity, with PRLM-P and PPR performing best. On 45 second test utterances, average two language, closed-set, forced-choice classification performance reached 94.5% correct. The best 10 language, closed-set, forced-choice performance was 79.2% correct
Keywords :
Gaussian processes; identification; natural languages; speech recognition; telephony; GMM; Gaussian mixture model classification; N-gram modeling; OGI multilanguage telephone speech corpus; PPR; PRLM-P; automatic language identification; closed-set forced-choice performance; development test subset; language-dependent language modeling; language-dependent parallel phoneme recognition; language-independent phoneme recognition; parallel PRLM; phoneme recognition; recognition complexity; telephone speech messages; test utterances; training requirements; Automatic speech recognition; Cepstral analysis; Clustering algorithms; Covariance matrix; Detectors; Laboratories; Natural languages; Telephony; Testing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389377